Gene Athe_1915 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1915 
Symbol 
ID7407328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2020303 
End bp2021319 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content40% 
IMG OID643716287 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_002573776 
Protein GI222529894 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID[TIGR01727] oligopeptide/dipeptide ABC transporter, ATP-binding protein, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0882054 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCTGAAA AATTGTTAGA AGTAAAGAAT TTGAAAACGT CATTTTTTAC TCATGTTGGA 
GAGGTTAAGG CAGTAAACGA TGTTTCGTTT GATGTATATG AGGGACAGAC TGTAGGTATT
GTAGGTGAAT CTGGAAGCGG TAAAAGTGTT ACCTCAATGT CCATCATGAG ACTTATTGCA
CCGCCTGGAA AAATAGTTGA CGGTCAGATA ATATTTGAAG GTAAGGACTT ACTTAAACTT
TCTGAAAAAG AAATGAGAGA CATAAGAGGA AACAAAATCA GTATGATATT TCAGGACCCT
ATGACATCTT TAAATCCTGT TTTTACAATC GGAAATCAGT TAATAGAAGC GATAAAGATA
CACAACAAAG TTTCAACAGC TCAGGCTAAA AAAAGAGCGG TTGAGATGTT AAAGCTTGTT
GGTATTCCAA GTCCTGAGCG AAGACTTTCA CAGTACCCGC ACGAGTTTTC GGGTGGTATG
CGCCAGAGAG TTATGATAGC GATGGCTCTT TCGTGCAACC CCAAGCTTTT AATTGCAGAT
GAGCCAACAA CTGCACTTGA CGTTACCATC CAGGCTCAGA TATTGGACCT TTTGAAAAAA
CTTCAACAGC AGCTTAAGAT GTCAATTATA CTTATTACTC ATGACCTTGG TGTTGTGGCA
GACATATGCC AAAAGGTGAT TGTAATGTAT GGTGGAATTA TTGTAGAGGA AGGAACTGTT
GATGATATAT TTTACAACCC CAAGCATCCG TATACATGGG GGCTTTTGAG GTCTGTTCCC
AAGATGCATT TAGGGCTTAA AAAAAGGCTT GTGCCAATAG AAGGACAGCC ACCAGATTTA
TTGAAGCCTC CAAAAGGATG TCCGTTTGCG CCAAGATGTG AATATGCAAT GAGAGTGTGT
TTGGAAGTAA GACCACCACT TTTCGAAGTT GAGGATGGTC ATATGTCAAG GTGCTGGCTT
AATCACCAGT ATGCTCCGCA GAGCTTGCTG GAAAAGGCAA AAGCTGCAAA TGAATAA
 
Protein sequence
MAEKLLEVKN LKTSFFTHVG EVKAVNDVSF DVYEGQTVGI VGESGSGKSV TSMSIMRLIA 
PPGKIVDGQI IFEGKDLLKL SEKEMRDIRG NKISMIFQDP MTSLNPVFTI GNQLIEAIKI
HNKVSTAQAK KRAVEMLKLV GIPSPERRLS QYPHEFSGGM RQRVMIAMAL SCNPKLLIAD
EPTTALDVTI QAQILDLLKK LQQQLKMSII LITHDLGVVA DICQKVIVMY GGIIVEEGTV
DDIFYNPKHP YTWGLLRSVP KMHLGLKKRL VPIEGQPPDL LKPPKGCPFA PRCEYAMRVC
LEVRPPLFEV EDGHMSRCWL NHQYAPQSLL EKAKAANE