Gene Athe_1917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1917 
Symbol 
ID7407330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2022362 
End bp2023282 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content36% 
IMG OID643716289 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_002573778 
Protein GI222529896 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000794188 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCAAGAT ACATACTCAA AAGGATAGTG TGGTCTATTG TATCATTATT CGTCATAGTT 
ACTGTTACAT TTTTTCTTAT GAGAATGATA CCAGGTGGTC CATTCACGGG TGAAAAGACT
TTGCCTGAGC AGATTTTGCA AAACCTGAAC GAGAAATATG GACTTAACAA ACCGCTTGGA
GTGCAATATT TCAAATATTT AAATAGCCTT TTACACGGTG ATTTGGGAAT TTCAATGAGA
AATCAAGGTA GAACAGTTAA TGAGATTATT GCAGAAACGT TTCCCATTTC TGCTAAGGTG
GGTATTTTGG CTATAATTTT GAGTTTGCTG ATAGGGATAC CGCTTGGTAT CTGGTCTGCC
GTACATCAAG GTAAATGGCA GGATAATTTG TCTATGATTA TAGCAACCAT TTTCATTACG
ATACCTGGAT TTGTACTTGC TGTAATTTTA ATGTATATCT TTGGTGTAAA GCTTCAACTT
GTACCTATAA TGGGATTAGA TGAACCTAAA AGCTATGTTC TTCCTGTTGT TACACTGGCA
GCATATCCAA TATCTTTTAT TGCAAGGCTT ATTCGAAGCA GTATGCTTGA AAGTTTATCA
CAGGACTATA TTAGAACTGC ACGCGCAAAA GGACTTTCAG ATTTCATAGT CATATACAAA
CATGCGCTGA AAAATTCTTT GATACCTGTT GTTACGTATT TGGGTCCTTT AATTGCAGGT
ATACTTACTG GTAGTTTTGT TGTTGAAAAG ATTTTCTCAA TCCCAGGAAT GGGGAGGTTC
TATGTTGATA GTATATCTAA CAGGGACTAT TCGCTTGTGA TGGGAACCAC AATATTTTAT
GCAGCATTTT TGATATTTAT GAACCTAATT GTTGACATTA TCTATGTATT TATAGACCCG
CGTATAAAAC TTGAGGACTG A
 
Protein sequence
MARYILKRIV WSIVSLFVIV TVTFFLMRMI PGGPFTGEKT LPEQILQNLN EKYGLNKPLG 
VQYFKYLNSL LHGDLGISMR NQGRTVNEII AETFPISAKV GILAIILSLL IGIPLGIWSA
VHQGKWQDNL SMIIATIFIT IPGFVLAVIL MYIFGVKLQL VPIMGLDEPK SYVLPVVTLA
AYPISFIARL IRSSMLESLS QDYIRTARAK GLSDFIVIYK HALKNSLIPV VTYLGPLIAG
ILTGSFVVEK IFSIPGMGRF YVDSISNRDY SLVMGTTIFY AAFLIFMNLI VDIIYVFIDP
RIKLED