Gene Athe_2380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2380 
Symbol 
ID7407799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2531442 
End bp2532785 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content38% 
IMG OID643716743 
ProductGalacturan 1,4-alpha-galacturonidase 
Protein accessionYP_002574222 
Protein GI222530340 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTACA ATGTTTGTAA TTTTGGTGCG AAAGGAAACG GAGTAGATAA AGACACAGAA 
GCGTTTAAAA AAGCAATTGA GGTATGTGAA AAAAACGGTG GTGGAACAGT TTATGTTCCC
GCTGGTATTT ATCATATAGG TGCACTACAT CTTAAAAGTA ACATGACACT TTATATTGAA
AGCGGTGCTG TGCTGAAATT TTCACAGGAT GAAGAAGACT ATCCACTTGT ATACACAAGA
TGGGAAGGCG AAGAGATGCA GGTTTACTCT CCACTGATAT ATGCAGAAGA TGCAGAAAAT
GTTGCAGTAG TGGGGTTTGG CACAATTGAT GGACAGGGCG AAAAGTGGTG GCGTCTTCAC
AGAAATAAAG AGTTAAAATA TCCAAGACCT CGTTCTATCT GTTTTTACAG GTGTAACAAC
GTTACTATCG AAGGAATAAA GATTATAAAC TCTCCAAGCT GGACGGTAAA TCCCATAGAG
TGCCAGAATG TTACAGTTCA CAATATAAAG ATTCAAAATC CATATGATTC ACCAAACACA
GATGGGATAA ATCCGGAGTC ATGTAAAGGC GTCAGAATAT CAAACTGCTA CATAGATGTT
GGTGATGATT GTGTGACGCT AAAGTCTGGA ACAGAAGACT GCAAAGAAAG AATACCTTGT
GAGAATATTA CAATAACAAA CTGTATAATG GCACACGGTC ATGGCGGTGT TGTTATCGGA
AGTGAGATGA GTGGCGGTGT TCGGAATGTT GTTATTTCAA ACTGTATTTT TGAAGGCACA
GACAGAGGAA TAAGAATAAA GACAAGAAGA GGTCGTGGAG GAGTTGTTGA GGATATAAGA
GTTTCGAACA TTGTAATGAA AAATGTTATG TGCCCATTTG CATTTTACAT GTATTACCAT
TGCGGTAAAG GTGGAAAAGA AAAGAGAGTT TGGGATAAGT CTCCATATCC TGTTGATGAT
ACAACACCAG TTGTTAGAAG AATATATATA AGTGATGTTG TTGTAAGGCA GGCAAGAGCA
GCAGCAGGAT TTTTATATGG ACTTACAGAG ATGCCAATTG AGGATGTTGT GTTTTCTAAT
GTTACTGTTG AGATGGCTCA AAACCCTGAA CCAGAACTTC CAGCAATGAT GAGTTATTTA
GAGCCAATGG CTAAAAGAGG GTTTGTCATA AATACTGTAA AGAACATAAG ATTTATGAAT
GTTACTGTAC TGGAACAGGA AGGTGCTGCT TTTGAACTTA ACAATTGTGA GAATGTAGAG
TTTTACAGAT GCAGGGCAAA AGATACAGCA GATTATTCTA AGATTTTGAG TCTGAACAAT
ACAATGAATC TGATTGCTGA GTAG
 
Protein sequence
MIYNVCNFGA KGNGVDKDTE AFKKAIEVCE KNGGGTVYVP AGIYHIGALH LKSNMTLYIE 
SGAVLKFSQD EEDYPLVYTR WEGEEMQVYS PLIYAEDAEN VAVVGFGTID GQGEKWWRLH
RNKELKYPRP RSICFYRCNN VTIEGIKIIN SPSWTVNPIE CQNVTVHNIK IQNPYDSPNT
DGINPESCKG VRISNCYIDV GDDCVTLKSG TEDCKERIPC ENITITNCIM AHGHGGVVIG
SEMSGGVRNV VISNCIFEGT DRGIRIKTRR GRGGVVEDIR VSNIVMKNVM CPFAFYMYYH
CGKGGKEKRV WDKSPYPVDD TTPVVRRIYI SDVVVRQARA AAGFLYGLTE MPIEDVVFSN
VTVEMAQNPE PELPAMMSYL EPMAKRGFVI NTVKNIRFMN VTVLEQEGAA FELNNCENVE
FYRCRAKDTA DYSKILSLNN TMNLIAE