Gene Acid345_3282 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3282 
Symbol 
ID4072694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3886455 
End bp3888023 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content61% 
IMG OID637985303 
Producthypothetical protein 
Protein accessionYP_592357 
Protein GI94970309 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.172995 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCG TCACCGCACA GGAAATGCGC GACATCGATC GCATTACGAC CGAGCGCTAC 
GGCGTGCCGT CGCTGACGCT GATGGAGAAC GCCGGGCGCG CGGCAGCAGA GATGGTTGTG
GAGAATTACC CCGAGGCCAG GTCCATTGCC GTGGTGTGCG GGAAGGGAAA TAACGGCGGT
GATGGCTTCG TAGCGGCGCG GCATCTGCAT AAGATGGACC GCGGGGTCGA GGTGCTGCTG
CTTGCTGATC CAGAGGGCTT GCGTGGCGAT GCAGCGGAGA TGTATCGGCA GCTTGGGTTC
GCTGCGACGA TCGTGAAATC GGAAGAGACG ATCTCATCCA ATTTGCAGCG CGCGTTTGCA
GAAGCCGACG TGATTCTCGA TGCGGTGCTC GGCACGGGAT TCAAGCCACC AGTCTCGCCG
TTGTACGCGA AGGCAATCGC CGCGATGAAC GCGAGCAAAT TGCCGATCGT CGCTGTGGAT
GTGCCATCTG GAGCGGACTC CGACGGTATG CAGCCGCAGT CGGGCGAGGC GATTGCGCGC
GCCGATGCAG CGGTAACTTT CACCGCGCCC AAACCGGTTC ACGTGTTCGG CGATCTGGTT
CGCGGAAAGA CTGTGGTTGC ACCGATTGGT TCTCCCGACG AAGCCATTGT CAGCAATCTG
GGTCTGAACG TAATCACGCC GGCTGACTAT GCGGCCGTGC TGGCCGCTCG GCCGCTCAAC
AGCAACAAGG GAATGTACGG CCACGCGCTG ATCGTGGCGG GATCGTTTGG AAAATCCGGC
GCAGCGGCGA TGGCGGGTAT GGCGTGTCTG CGCGCTGGTG CCGGGCTCGC GACCGTGGCA
ACGCCGAAGT CGGTGCTCAC CAGCGTGGCG TCCTACGCGC CGGAGTTGAT GACTGAGTCG
CTCGCCGAGA CCGCGGACGG CACGATCTGC GAAGCCGCGA TTTGGGCCAT TCAGGAACTC
GCGAAGAAGA TGACAGTACT TGCGATTGGA CCCGGGCTGA CGCAGAACGC TGAGACCATC
CAGGTCGTAC GAGAGCTCGT GCGAGCCAGC GAAAAGCCCA TGGTGATTGA TGCCGACGGG
CTGAATGCGC TCGTCGATCA AACCGAGGTT CTGAAAGATG CGAAGGCAGC CACGATCATC
ACCCCGCACC CCGGTGAGAT GTCGCGGTTG TGTGGGATAA GTACGAAAGA GGTCCAAGCC
GACCGCGTAG GAATCGCAAA GAACTTCGCT GCATCTCGTT ATACGATCGT TGTGCTCAAG
GGAGATAAGA CCGTCATCGC CGCGCCTTCG GGAGAAACGT GGATCAACTG CACCGGCAAT
CCCGGCATGG CAACCGGAGG CACTGGCGAC GTGCTTACCG GTATCCTCAC CGGCCTGCTG
GCGCAACATC CGCAGGATCC GCTGCTGTGC GCGATTGCGG CGGTACATCT CCACGGGATG
GCCGGCGACC TTGGCCGCGA TAGGGTTGGT GAGATTTCCC TGATTGCCAC CGACTTGATC
CATGCCCTGT CTGGGGCATT CGAGCGCGCG AAAAAGAGTT TGCAGAAGCC CTGGGTTCCC
CTAAATTAG
 
Protein sequence
MKIVTAQEMR DIDRITTERY GVPSLTLMEN AGRAAAEMVV ENYPEARSIA VVCGKGNNGG 
DGFVAARHLH KMDRGVEVLL LADPEGLRGD AAEMYRQLGF AATIVKSEET ISSNLQRAFA
EADVILDAVL GTGFKPPVSP LYAKAIAAMN ASKLPIVAVD VPSGADSDGM QPQSGEAIAR
ADAAVTFTAP KPVHVFGDLV RGKTVVAPIG SPDEAIVSNL GLNVITPADY AAVLAARPLN
SNKGMYGHAL IVAGSFGKSG AAAMAGMACL RAGAGLATVA TPKSVLTSVA SYAPELMTES
LAETADGTIC EAAIWAIQEL AKKMTVLAIG PGLTQNAETI QVVRELVRAS EKPMVIDADG
LNALVDQTEV LKDAKAATII TPHPGEMSRL CGISTKEVQA DRVGIAKNFA ASRYTIVVLK
GDKTVIAAPS GETWINCTGN PGMATGGTGD VLTGILTGLL AQHPQDPLLC AIAAVHLHGM
AGDLGRDRVG EISLIATDLI HALSGAFERA KKSLQKPWVP LN