Gene Athe_1819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1819 
Symbol 
ID7408606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1892243 
End bp1893409 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content34% 
IMG OID643716196 
Producthypothetical protein 
Protein accessionYP_002573685 
Protein GI222529803 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000132371 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGATA TAATTTGTTT TTCAACAACA CCATGGGATC CTATACCAAC ACGTAAACAA 
CAGATAATGA AAAGAATGCC ACAAAACTGT AGAATATTTT ATTTAGACCC ACCTGTGACC
TTGATAGGTC CATTAAAAGA CCCTAGTTTG AGACCTTACC TAACAAGATT TAGAAAGTCT
CCAAAAAGAA TAAAGGAGAA CCTTTTTGTA TTTGCTCTGC CACCAATTAT TCCCTTTTAT
AACAAGAAAA GGTCCATCAA TAAGTTCAAT CAAAAAATGA TAGCAAATTT TGTAAAAGAA
GTTATCTATC AAAACTTTGA TTTAAAGTCA CCTATAATAT GGACCTACAT GCCAAACACT
GTTGATCTTC TTGAACATCT TTCTTACAGT TTTTTAGTTT ACGACTGTAT AGACAAACAT
TCAGAGTTTC AAGGGTTTAT TGACAAGGCT TTGGTTGAAA GCATGGAAGA TGAGCTTGCT
CAAAAGAGTA ATGTAGTTTT TACAACAACC CATGGATTAT ATAATAAGCT CAAGTTATTA
AATCCTCACA CATATCTTGT GCCAAACGGT GCTGAGTTTG AACACTTTAA TAAAGCTTCA
AATAAACTGC CTGTACCCGA TAAGATGAAT AATATACCCC GTCCTATCTT TGGCTTTGTG
GGTGTTATCC ACACATGGAT AGACACTCAG CTTATAGAAT ATTTAGCAAA AGAAAAAAGA
GAGTGGTCTT TTGTTTTGAT AGGACCTGTG GGTGCTGGTG TAAGCGTGGA TAATTTAAAG
AAGCTGAGCA ATATTTATTT GCTTGGAAGG ATTGATAACA AGGATTTGCC GCAGTATGTA
TCTCAATTTG ATGTTTGCTT AAACTTATTC AGAACAAACA AGCTCTCAGA GAATGTAAGC
CCGCTAAAAT TTTATGAATA TTTGGCAACA GGAAAACCAA TTGTTTCAAC TTCAATGCCC
CAAGTAGAAC AATTTTCCGA TGTTGTGTAT ATTGGCAAAA ACTATGAAGA TATGCTTGTA
AAATGCATTC AAGCTCTGCA GGAGGCACAA AATCCTAATA TTGAAAAGAT AGAAAAAAGA
ATAGAGTATG CAAAGCAAAC CTCATGGGAT AGCAGAGTAA CTCAAATTAT TGATATACTA
AAGAGGGAAG GGATAAACAT TGAATAG
 
Protein sequence
MIDIICFSTT PWDPIPTRKQ QIMKRMPQNC RIFYLDPPVT LIGPLKDPSL RPYLTRFRKS 
PKRIKENLFV FALPPIIPFY NKKRSINKFN QKMIANFVKE VIYQNFDLKS PIIWTYMPNT
VDLLEHLSYS FLVYDCIDKH SEFQGFIDKA LVESMEDELA QKSNVVFTTT HGLYNKLKLL
NPHTYLVPNG AEFEHFNKAS NKLPVPDKMN NIPRPIFGFV GVIHTWIDTQ LIEYLAKEKR
EWSFVLIGPV GAGVSVDNLK KLSNIYLLGR IDNKDLPQYV SQFDVCLNLF RTNKLSENVS
PLKFYEYLAT GKPIVSTSMP QVEQFSDVVY IGKNYEDMLV KCIQALQEAQ NPNIEKIEKR
IEYAKQTSWD SRVTQIIDIL KREGINIE