Gene Acid345_3816 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3816 
Symbol 
ID4071100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4511074 
End bp4513011 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content55% 
IMG OID637985839 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_592890 
Protein GI94970842 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.280981 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGTATCA ACGAAATTCC GCAGTTTCCG AGGTGGTTTA CAAAAATCCA GCGTACGAGA 
ACTGGACTCA GTCTGCTCCT GCAGGCTGCC ATTAGCGCGG TTGCTCTGCT TTGTGCCTGG
ACCTTGCGAT TCGAATTTGC GTTGCCCAAT CAGCGACTGC TGTGGGTCTC AGTGCCCATC
CTGGTGTTCC TTCGAATTGC GGCGATTTAC CGCTTCAACC TTGATCATGG GTATTGGCGT
TTCAGCGGCA TCTCTGATGC CTTGAATATT GCCAAGGCCG TCACTGTCAG TTCGTTCTGC
TTCATGCTCG TTCTCCGATA CGGATTTCAG CTGACAGCTT TCCCAATCTC TATCTATTTG
CTTGAGCCGC TCTTGTCTGC GTTCGGATTG GGAGCAAGCC GCGTCGCGGT TCGCAGTGTG
TTGGTCAAAC TGGAGGCATC GCAAGGTAGA AAAAGGCATT CTCGCGTCGT CATAGTGGGA
GCAGGGTTTG CCGGGCAGAT GTTGCTGCGC GAACTGCTCA CATCGCGTGC CAGTCACGTC
GCCGTCGCAC TCGTGGATGA TGACTCCTCG AAGCGTGGGG CTCTGGTACA CGGCACGCGT
GTCGAAGGGG CGATCAAGAA TCTCCCGACG ATTGCTGCAA AGCATCGCGC GGACGAGGTT
CTTATCGCGG TGCCATCTGC GACGCGTGAT CAGATGTTTC GAATTGTTGA AGCTTGCCAT
GCGGCGAGAG TGCCATACCG CACGGTGCCG AGTCTCAACG ATCTCGTGGC GGGGAAGGTG
GCGATCAGCG AACTTCGTGA GATCGATCTG GAAGATCTTC TCGGACGTGA ACCGGTGCAT
CTGGAAAACG AGCCGGTTCG CAAGAGTATC GCTGGCCGGG TTGTAATGGT AACTGGAGCC
GCGGGGTCCA TTGGCTCGGA ACTCAGTTCC CAGATACTAT CGTTCGGCCC GGCGATGTTG
ATCTGCGTCG ATCACGATGA AACCGCCCTA TTTAATCTTG AACACCGGCT CGCGGTACAG
GAAACGAGCA GCCGGCTTTT GTATTTTGTA GACGATGTCG GCGATTCTGA GCGGATGCGT
CATCTGCTGC TGCACAACGA GGTGGATTTC ATTTTCCACG CAGCTGCGTA CAAGCACGTT
CCGATGATGG AACGCAATCC TCGAAAGGCT TTGCGCAACA ATGTCTTCGC GCTCCGGCAT
TTCGTCGAGG CAGCAGAAGA GTCTGGAGTC GAGGCGTTCG TTTTGATTTC CTCCGACAAG
GCGGTTAATC CCACGAACGT GATGGGGTGC ACGAAACGTA TCGGTGAGTT GATCTTGTCG
GCAAATCGAA ATCGGCGAAT GCGTTGCGTG AGTGTGCGGT TCGGAAATGT GCTGGGTTCG
CAGGGAAGCG TGGTTCCGAT CTTTCAGCAA CAGTTACGCG AACACAAGCC GCTCACGATC
ACTCATCCCG AGATTACCCG CTTCTTCATG ACGGTCTCAG AGGCGGTATC TCTGGTGCTT
CAGGCGTTTA CAATTGGAAC TCACGGCGAC ATTCTGGTTC TCGATATGGG GCGTCAAATC
TCGATTGTTC GCATGGCGAA AGCGCTAATC CATCTCTCCG GATTCTCCGA GGAAGAGGTG
CCGATCAAAT ACACGGGATT ACGTCCTGGC GAGAAACTGT ACGAAGAGCT GTTTTATGAC
TCGGAAGTCC GGATTGCGAC GGAGCGCTCG AAGGTTCTTC GCACCAAAGG GAAGATTCTC
AGTTGGGCCG AGTTAGATCG GCGGCTCCGA ACTTTGGAGT GGAAGTTAGG TGAGGCAAAC
GAGGATCAAC TCCGACGCCT GATGGCGGAA ATTGTTCCCG AGTACTCAAT CACGCCCAAC
GAACAGCGTC CATTGCCTGC GTCAGTCCCG ATCGCGAGCA GTGTGCACGG CAGGCATGCT
GCCGCGGGAC TGGATTAA
 
Protein sequence
MSINEIPQFP RWFTKIQRTR TGLSLLLQAA ISAVALLCAW TLRFEFALPN QRLLWVSVPI 
LVFLRIAAIY RFNLDHGYWR FSGISDALNI AKAVTVSSFC FMLVLRYGFQ LTAFPISIYL
LEPLLSAFGL GASRVAVRSV LVKLEASQGR KRHSRVVIVG AGFAGQMLLR ELLTSRASHV
AVALVDDDSS KRGALVHGTR VEGAIKNLPT IAAKHRADEV LIAVPSATRD QMFRIVEACH
AARVPYRTVP SLNDLVAGKV AISELREIDL EDLLGREPVH LENEPVRKSI AGRVVMVTGA
AGSIGSELSS QILSFGPAML ICVDHDETAL FNLEHRLAVQ ETSSRLLYFV DDVGDSERMR
HLLLHNEVDF IFHAAAYKHV PMMERNPRKA LRNNVFALRH FVEAAEESGV EAFVLISSDK
AVNPTNVMGC TKRIGELILS ANRNRRMRCV SVRFGNVLGS QGSVVPIFQQ QLREHKPLTI
THPEITRFFM TVSEAVSLVL QAFTIGTHGD ILVLDMGRQI SIVRMAKALI HLSGFSEEEV
PIKYTGLRPG EKLYEELFYD SEVRIATERS KVLRTKGKIL SWAELDRRLR TLEWKLGEAN
EDQLRRLMAE IVPEYSITPN EQRPLPASVP IASSVHGRHA AAGLD