Gene Athe_2243 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2243 
Symbol 
ID7407662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2379816 
End bp2381705 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content37% 
IMG OID643716609 
Productamino acid transporter-like protein 
Protein accessionYP_002574088 
Protein GI222530206 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTTGAGA AGATCAAAAG AATTCTCATA GGAAAACCTC TACCAAACGA AGCTGAAAAG 
ACAGAAAAGT ATGGTGTGCT GTGGGGCCTT CCCATTTTGT CAAGCGATGC TATATCATCT
GTTGCATACG CTGGACAAGA AATTTTGTAT GTCCTCTTGC CTGCAATAGG ATTTCTTGCT
TTTAAAGAAA TATCTGTTGT AACTACCGCA ATTATAATAC TTCTTTTTAT TTTAATGCTC
TCTTACAGAC AAACTATTGA AAACTATCCA AATGGCGGCG GTGCTTTCAT TGTCGCGAAG
GACAACCTCG GCGTTTTGGC AGGCATTGTG GCAGGAGCAG CTTTGTCAGT TGACTACATC
CTGACAGTGG CAGTTTCTAT TTCTTCAGGT GTTGACCAGA TTGTAACCGC ACTTGAATTT
TTAAAGCCTT ATAAGATTGC CCTGTGTCTA TTTTTAGTTC TATTTTTGAT GATTGGAAAT
TTGCGAGGAA TAAGAGAATC ATCCAGAATA TTTGGTATAC CTGCTTATGC TTTTATGTTT
TCAATTTTGG CTTTAATTAT AGGGGGATAT ATAAAGCTAA AGATGGGATA TGTTCCGCCA
GAGCCAAAGA TAAACTATTC CTCTCATCCT GTTACTTTAG TTTTGCTTTT GAAGGCATTT
GCAAGCGGGT GCACAGCACT GACTGGTATT GAAGCTGTAT CAAATGCTGT TCCAAACTTT
AAAGACCCGG CAACAAAACA TGCTAAAACT GTTTTGCTTT TATTATCTTT AATAATCCTC
GTGTTATTTG GCGGAACAAC ACTTTTGGCA ACACACTACC ATATAGTTCC TACTGAAGGT
GCCATGCTTG TTTTGATGGC GCAAGAAATT TTTGGCAAAA GCTTTATGTA TTACATTGTT
GCTGCAACAA CCTTTATAAT CCTTGTTTTT GCTGCAAACA CTGCATTTTC GGGTTTCCCA
ATGCTTGTCG CTGTCATGGC AAAAGAAGAA TTTGTTCCAA GGCAGCTCAG CCTTAAAGGT
GATAAGCTTA GCTACTCAAA TGGAATTATC ATCCTTGCTT TGGTATCCGC CCTTTTGATT
GTAGCATTTA ATGGTAACGT CACAGCACTC ATTGGACTTT ATGCGGTTGG TGTTTTTATA
TCATTTACAC TATCTCAATC AGGTATGTTT GTAAGATGGC TTAAGAACAA AGGCAGTCAT
TGGCACATAA AAGCATTTAT AAACGGATTT GGTGCACTCA CAACATTTGT TGTTGTCATT
ATAATTGCAA TAACTAAATT CAAAGAAGGT GCGTGGATTG TTGTTCTTTT AATTCCTCTA
CTTGTCTTTG CAATGATAAA GGTAAAGCTT CATTATATTG CCGTTGCCGA CCAGCTGAGA
GTAACGCCTG AAGATAAAGA ACTTTTGGAT GTTGAACACA ATATATATAG AAACAGGGTT
ATTGTTCCAA TTGAAAGTTT AAACAAGGCA AGCATTCGTG CTTTGAGATT TGCAAAGACA
ATCTCAGACA ATGTAATTGC TTTTAATGTA TCAATAAATG AAGAGCAGGC AAAAAAGCTC
CAAGAAAGAT ATAAAATGCT CAACTGCTCC ATCCCACTTG TAATAAGGTA TTCACCTTAT
AGAAAAATAT TAGAACCGCT TTTAGAGTAT ATAAAGTCAG AAGAGTACAA CTATCAAAAG
GGGGACATGA TAACAGTCAT CATACCCAAA TTTACCGTAC AGCGCTGGTG GCAAAAGATT
TTGCACAACC ATACATGGCT GTTTATAGAA AAAGAACTTT TAAAACACAA GCACATTGTT
GTATCTGTCA TGCCACTGCA GCTGAAAGAT GATGATGTGG TTTTGAAGAA AAAGAACAAA
CCTTTGTGGA AGATATTAGA GGAAGACTAA
 
Protein sequence
MFEKIKRILI GKPLPNEAEK TEKYGVLWGL PILSSDAISS VAYAGQEILY VLLPAIGFLA 
FKEISVVTTA IIILLFILML SYRQTIENYP NGGGAFIVAK DNLGVLAGIV AGAALSVDYI
LTVAVSISSG VDQIVTALEF LKPYKIALCL FLVLFLMIGN LRGIRESSRI FGIPAYAFMF
SILALIIGGY IKLKMGYVPP EPKINYSSHP VTLVLLLKAF ASGCTALTGI EAVSNAVPNF
KDPATKHAKT VLLLLSLIIL VLFGGTTLLA THYHIVPTEG AMLVLMAQEI FGKSFMYYIV
AATTFIILVF AANTAFSGFP MLVAVMAKEE FVPRQLSLKG DKLSYSNGII ILALVSALLI
VAFNGNVTAL IGLYAVGVFI SFTLSQSGMF VRWLKNKGSH WHIKAFINGF GALTTFVVVI
IIAITKFKEG AWIVVLLIPL LVFAMIKVKL HYIAVADQLR VTPEDKELLD VEHNIYRNRV
IVPIESLNKA SIRALRFAKT ISDNVIAFNV SINEEQAKKL QERYKMLNCS IPLVIRYSPY
RKILEPLLEY IKSEEYNYQK GDMITVIIPK FTVQRWWQKI LHNHTWLFIE KELLKHKHIV
VSVMPLQLKD DDVVLKKKNK PLWKILEED