Gene Athe_1955 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1955 
Symbol 
ID7407369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2068141 
End bp2069247 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content31% 
IMG OID643716327 
Productspore germination protein 
Protein accessionYP_002573815 
Protein GI222529933 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID[TIGR00912] spore germination protein (amino acid permease) 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.226978 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAATAA ACGACAATGA CAAAATTTCC AGTTTCCAGT GTTTTGTGTT GTTGGTATCT 
GTCATGATAG GAATTGGTAT AATGTTTATG CCTGCGTCTG TTGCAAAAGC ATCAGAACAA
AATGGATGGT TTGTTGTGAT GCTTGGTGGA ATATTATCTT TTTTAGTTTT TCTTCTAATA
TTTAGAGTTA CTATGATAAA CCCTGATTTG ACTATTATTG AACTTTTAAA TAAAGCGTTT
GGGAAGATTT TGGGAACTGT ACTTTCATTT ATATATGTAG TTTACTTTGT AATATTTTCT
GCTTTTGAAA CAAGGCTTAT TGCAGAGACA GCCAAGGAAT TTTTATTTAC ACTCACACCG
AATGAAGTTT TGATTATTTC ATTTCTTTTT ACTTGTGCTT ATATTTCAAG ATATGGAATA
GAAGTAATTG CACGCATGTG TGAAGTTTTA ATGCCGGGGA TTGTGCTTAT AATTATTGTT
TTGAGCTTTT TTGTATACCA AAGGCTTGAC TTTTCAAATC TTCTTCCTGT TTTAAACATT
CCTTTTACAA AACTTATTAA GGGTATTGGT ACAACAATTT TTAGCTTTCT TGGTTTTGAA
GTATTTTTAT TTTTTATGCC ATATGTGAGG CGAAAGGACA AGCTAATCAA GAGTGCTTTT
TTTGGATTTC TTGTTACGAT TTTGCTCTAC GAGGTTATAG TAATCTTTGC GACAGCTGAT
TTTGGTTCAA AAGTAGTACA AACCATGGTA TGGCCGACAC TGAATCTTTT TAGAGATGTG
ACAGTTTTGG AAATAGTTAT TGAAAGACCT GAAAGTATTG TTGTTTCCCT ATGGATGATT
ACAACCTACA CCACAGAGAT TATATTTTTA ATGACAACAG GGTTGATTTT GGCAAGGATT
TTTAACACGA AAGAACATAA TTTCTTTGTA TTCATTCAGC TGCCATTTAT TTATATATTA
TCATTGATAC CACAAAATAT AGTAGAGACA CAAAGATTTA TGGATTATTT TAGCTACTTT
TTTGCGTCTT TTACTGTATT GTTATTACCT TTAATAACTT TCGTTACTCT CTCAATCAAA
AAGAAGGTGA AAAAATATGA AACATAG
 
Protein sequence
MIINDNDKIS SFQCFVLLVS VMIGIGIMFM PASVAKASEQ NGWFVVMLGG ILSFLVFLLI 
FRVTMINPDL TIIELLNKAF GKILGTVLSF IYVVYFVIFS AFETRLIAET AKEFLFTLTP
NEVLIISFLF TCAYISRYGI EVIARMCEVL MPGIVLIIIV LSFFVYQRLD FSNLLPVLNI
PFTKLIKGIG TTIFSFLGFE VFLFFMPYVR RKDKLIKSAF FGFLVTILLY EVIVIFATAD
FGSKVVQTMV WPTLNLFRDV TVLEIVIERP ESIVVSLWMI TTYTTEIIFL MTTGLILARI
FNTKEHNFFV FIQLPFIYIL SLIPQNIVET QRFMDYFSYF FASFTVLLLP LITFVTLSIK
KKVKKYET