Gene Acid345_3126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3126 
Symbol 
ID4070240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3717583 
End bp3719484 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content59% 
IMG OID637985145 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_592201 
Protein GI94970153 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.503601 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCATGA AGCAAAGAGC TTGGGCTGTC GGGCTATTTC AAATCGTCCT CGTCGTCTGT 
TCACTATTCG CGGCCTGGGC CTTACGATTC GACTTCCGAC TGCCACACCT GGAATACGTA
CTCCAGGCAC TGCCGATACT CATCGTCCTG CGGCTGGCTG CATTTGCGCG CTTCAACCTT
TTTCATGGGT ACTGGAGATA TACCGGCGTT AACGACGCCC TCGATATCGC CAAGGCGGTT
TCGACCAGTT CGATCGTCTT TGCCATTGTT ATCCGGTACT TACTCGGAAA TAGCCACTTT
CCAATTTCGG TTTATCTGCT CGAAGCCGCG CTATCGCTTC TGCTTCTGTG CGGCGTTCGG
GTCGCTTCGC GAGCCATGAT GGAATCGGCG ATGCGCGAGG CCCAATCCAC CGGTAAGGGC
GTCGTGATCG TGGGTGCCGG GTTTGCCGGC CAACTCCTCG TCCGTGAGCT ACAGCGTCCC
GAGAGCGGCT TCCGTCCCGT CGCTTTTGTC GATGATGACC CACGTAAGCA GGGAGTGAAG
ATCCAGGGGC TGCCGATCGC CGGAACGGTG GAAGAACTCT CGAGGGTGTT GCGGGAATTC
GGCGCGACTG AAGTGCTGAT TGCCATTCCG AGCGCGAACG CCGCCGAAAT GCGACGGATC
GTCCAGATCT GTTCCAACGC CCGGGTCGGG TTCAAGACTA TCCCGAGCCT GGGAGAACTT
GCGTCCGGCA ACGTCGGCGT TACCGAACTT CGGTCGGTAA ACCTCGAAGA CCTACTCGGT
CGCGAGCCGG TCAAGCAGGA CCTCGAAGCC GTCCGCGACG TGCTGAGCGG CGCGGTGGTG
ATGGTAACGG GCGCTGCCGG ATCCATCGGT TCCGAACTAT GCCGCCAGAT CCAGGGCTAC
GGGCCGTCTC TGCTGATTTG CGTTGACCAG AACGAGACCG GTTTGTTCAA CCTGCAACAG
GAGTTGCTGG ATTTCCCGAA CCCGCATGCG GCGGCGTTTT TCGTAGCCGA CGTCGGCGAC
GCTCCGCGAA TGCGTCATCT CTTCCAGCGC TACCGGGTTG ACTACGTATT CCATGCCGCG
GCATACAAGC ACGTTCCGCT GATGGAAGAC AATCCGCGGG AAGCGATCCA GAACAACGTT
GTCGCCCTGC GAGATTTAAT GCGGATCGCC GATAAAGCCG GTTGCAAGCG GTTCCTGCTG
ATCTCCTCCG ACAAGGCCGT CAATCCGAGC AGCCTGATGG GCTGCACCAA GCGTGTCGGT
GAACTTCTCC TCGGGTCGTG GCCAACCACA GGCATGGATT GCGTGTCGGT GCGCTTCGGC
AACGTGCTGG GATCCCAGGG CAGTGTCATT CCGCTCTTCC AGCAGCAGAT TACGCGCCAC
CGCCGAATCA CGGTGACGCA CAAAGACATC ACGCGCTTCT TCATGACCAT TCCCGAGGCG
GTTGCCCTGG TGCTTCAAGG CTTCACCGTG GGTAGCCATG GCGACATCCT GGTGCTGGAT
ATGGGCGAGG CGATCCGAAT CGTGGACATG GCGAAGGCGC TGATTCGCCT CTCCGGCAAG
TCTGAGGAAG ACGTAGAAAT CGTCTTCACC GGTCTGCGTC CGGGCGAGAA GCTCTACGAA
GAACTGTTCT ATGCCCACGA GTCTGTCGAG CCCACCGACG TTCCGAAGGT GCAGAAGACG
CGTGGCCAGA TGATTGCGAC CGAGAAACTT GCCCATATGA TCGATGAATT GGAAGGGCTG
ATACAGACGG AGCGGGAAGA TGCGGTCCGC GCCAAGATGA AGCAGATCGT TCCGCAATAC
ATGTATGCGC CGGTCCGCGA GTATGCAAAG CCGCCGGTTC GTGCCTTCGA GGTCATGCGC
GGCAAAGACA TGTCATCGCA CAAGGCAGCC TCCGCCGACT AA
 
Protein sequence
MFMKQRAWAV GLFQIVLVVC SLFAAWALRF DFRLPHLEYV LQALPILIVL RLAAFARFNL 
FHGYWRYTGV NDALDIAKAV STSSIVFAIV IRYLLGNSHF PISVYLLEAA LSLLLLCGVR
VASRAMMESA MREAQSTGKG VVIVGAGFAG QLLVRELQRP ESGFRPVAFV DDDPRKQGVK
IQGLPIAGTV EELSRVLREF GATEVLIAIP SANAAEMRRI VQICSNARVG FKTIPSLGEL
ASGNVGVTEL RSVNLEDLLG REPVKQDLEA VRDVLSGAVV MVTGAAGSIG SELCRQIQGY
GPSLLICVDQ NETGLFNLQQ ELLDFPNPHA AAFFVADVGD APRMRHLFQR YRVDYVFHAA
AYKHVPLMED NPREAIQNNV VALRDLMRIA DKAGCKRFLL ISSDKAVNPS SLMGCTKRVG
ELLLGSWPTT GMDCVSVRFG NVLGSQGSVI PLFQQQITRH RRITVTHKDI TRFFMTIPEA
VALVLQGFTV GSHGDILVLD MGEAIRIVDM AKALIRLSGK SEEDVEIVFT GLRPGEKLYE
ELFYAHESVE PTDVPKVQKT RGQMIATEKL AHMIDELEGL IQTEREDAVR AKMKQIVPQY
MYAPVREYAK PPVRAFEVMR GKDMSSHKAA SAD