Gene Athe_2183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2183 
Symbol 
ID7408376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2312099 
End bp2313454 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content37% 
IMG OID643716548 
Productprotein of unknown function UPF0027 
Protein accessionYP_002574031 
Protein GI222530149 
COG category[S] Function unknown 
COG ID[COG1690] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00364968 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA TAAGAGACGG TGTTTATACA AACGATTATG CCATATTCTT CATGACAGAA 
GAAATTTTAA AGGACCTTGA CGAAGGGGTG CTCCAGCAGG CAAAAAACGC ATCCCAAATT
CCAAATGTAG AATTTTTGGG CTATACACCA GATGCACACA TAGGCAAAGG TACTTCAATT
GGCACAATAA TCGTTTGGGA CATGTCAAAG GCGTGGATTT CACCAACAAT TGTTGGTGTT
GACATAGGTT GTGGTATGAG ACTGATTCTG ACAGACAAGT TTGCAGATGA TATAGATAAA
GCACTTTTGA AGAAAATAAT GGATGAGGTA GAAGATTTGA TTCCAACAGG TGTTGGTAAG
AAAAACAAAA AGATAGCTCT TTCCAAGACA AAGTATGAAG AGTATCTTCA AAATACAGAG
ATTGATAAGG ACATTTCAGA CAAGATGGTT CTCATTCATG AGTTTGACCT TGACACAATA
CCGGATGAGG CTCATGAGAT TGGTAAAGAG CAATTTGCAA CCTTGGGTGG AGGCAACCAC
TTTATAGAGT TTCAAAAACT TCATGTCATA GATAAAATTA TTGCAGAAAA ATGGGGACTT
TTCGATGGGC AGTTTGTTGT GATGATACAT TCTGGTTCGA GAAGGTTTGG AGCGGTCATT
GGCGATTATT ATCAAAAGAA ATTTAAAGAC GTTATGAAAT CCAAGGGTAT CACTACGCCA
GACCCGCAGC TTACCTTTTT GCCAATTGAC AACAAGGTTG CAAAAGATTA TATTAAAGCT
ATGCAGTCAG CAGCTATTTA TGCAAAAATA AATAGACATT ATATGAGCAA CTTTATAATA
TCAGTCTTAG AAAAACACTC AATTGACGCT TGGGTTTTAT ATGACGTTGC ACATAACATT
GCATACATGG AAAGATTTGC AAACAGAGAA AAGCTTGTTA TAAGAAAAGG GGCAACAAGA
GCATTACCGC CAAACCACTA TTTGATTCCG AATCCTAAAT TTGCTGAGAC AGGACATCCT
GTGATTTTAC CTGGCAGTAT GGGTTCAAGT TCATATCTTA TGAGGGGAAT TGAGGACAAT
ATAATAAGTT ATCATACAGT CAACCATGGA GCAGGCAGGG TTTTATCACG AACAAAGGCA
AAAAAGACAA TTTCCATTGA AGAATTTTCA AAAGCTTTAA AACAGGGGCA AAGCGGAGAG
ATTCTTATAA ACACTAAAAA CCTAAAAGAT TTTTTAGATG AAAGTCCACA GAGTTATAAA
GACATTGAAC TTGTGATAAA TTCAGTAATT ACATCCAGGC TTGCTACTCC TGTTGCCAAA
ATGGAGCCGC TTGGGGTCAT AAAAGGAAAA GATTAA
 
Protein sequence
MKKIRDGVYT NDYAIFFMTE EILKDLDEGV LQQAKNASQI PNVEFLGYTP DAHIGKGTSI 
GTIIVWDMSK AWISPTIVGV DIGCGMRLIL TDKFADDIDK ALLKKIMDEV EDLIPTGVGK
KNKKIALSKT KYEEYLQNTE IDKDISDKMV LIHEFDLDTI PDEAHEIGKE QFATLGGGNH
FIEFQKLHVI DKIIAEKWGL FDGQFVVMIH SGSRRFGAVI GDYYQKKFKD VMKSKGITTP
DPQLTFLPID NKVAKDYIKA MQSAAIYAKI NRHYMSNFII SVLEKHSIDA WVLYDVAHNI
AYMERFANRE KLVIRKGATR ALPPNHYLIP NPKFAETGHP VILPGSMGSS SYLMRGIEDN
IISYHTVNHG AGRVLSRTKA KKTISIEEFS KALKQGQSGE ILINTKNLKD FLDESPQSYK
DIELVINSVI TSRLATPVAK MEPLGVIKGK D