Gene Acid345_2175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2175 
Symbol 
ID4073117 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2594941 
End bp2596806 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content57% 
IMG OID637984191 
Producthypothetical protein 
Protein accessionYP_591250 
Protein GI94969202 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.171043 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGACC CAGATACCAC TCCCACGCTC GTCGAGCCGA CTACGAATGT GGATGATCCT 
GCAGACAACC AACAAGTAGA AGCAACGAAG GAAACTTCGA AGGTCGCACA AAAGCGGAGT
CCCACGAGCA GCGACACGTG GATCGGCTAT GTATTTTGGC TGATCATCAT CTTTTGGATT
CTGGTGGTGA AAGTTCCACC TATGCACGCG TTTTTTTGGC GGTATTGGTG GGAGTCGCTC
ATAACCCTGG TGTTGTTGAT CGCGGTGATT GGATCGTACG CGCCGTGGCG GCAGAAGATT
CGGAGCAGCG TACACACACG GGTTGGCTCG GTGATCGTGG CGCTCGCGGG TATCTCGATT
CTTGAGGGCG CTATCAGCGT TCTTCCACAG CGATACCAGA TCCTGGCGTT GCGCTCCACC
TTCCTGCTGG TGGTGTGCCT GTTTCCGGTG ATCCTGTATT ACCTGTTCAT CACAACGCGA
AAGTACAGCT TACTGAATGA ATTCATCACC AACCTGGAAC GTCTCGGATT GCTGCGCGAA
CGAGTGACGA CTCCGCGTTC GGCTTCATCT GCCGCGAAGG AGATCGCCCC GGACTCAGCG
TTGAATTTGC GTATTCTGAC ATATTTGCAA AAATTCGAGG CGGTTTACGG AAGCCTGGAT
CCAGAACTCG TGAGCGAACT CGTGGCTGCT ACAGACCCAG CGGCGGTGTT TGCGAAGCCG
GAGTGGAACC GACGCGCCAC TGCGGGATTC TCCAGCATCT TTACTCCTGA AACTACGGCT
CCACTGGTTA CAGCTACCGG GCTGATTGCG CTCGGGTGGA TCCTGGCGTT ACCGCCATGG
CAGGCGGCTG CGCCCCAGCC GCTTGCGGCT CAAGCCAAGA GCGCCAACAC GCAGGTGCTT
TCGGCAGCGA CGCCGGGATT GCCGACGAGC GCAACTGGAA GTACGGAGAC TCCGCTGCCC
GCGAAGGCCG CAGATGCCGC GCCGTCGACG GTGGCACAGA CGAACACAGT GTCGTGGCTG
CGGGCTTTGT ATCCGGACGA ATCGCCGGTG TATTTCGCGT TCCTGGGCGC ATATTTCTTC
TCGATTCAAA TGCTGTTCCG CCGATATGTG TTGAAAGACT TGCGCTCGAG CGCCTATGTA
GCAGTGTCGC TGCGGATTGT GATGGCGGTA ATGGGGACGT GGGTGATGAT TCCGACCGCG
CGCACGTTGC ACATCCTGGA TAGCTCCTCA GACCTGAACA CCAATTCCAA GCTGCTGGTG
CTGAGCTTTG TGATCGGAGT GTTTCCGCCG GTGATCTGGC AATTCCTGCA GGCGGCGTTC
AAGAAGATCA GCGGGGCACG ATATTTCTTG CCGAGCTTGA GTTCGGAGTT GCCGCTGAGC
AGCCTTGATG GACTGACCGT GTGGCACGAG GCGAGACTCG AAGAAGAAGA CATTGAGAAT
GTGCCGAACA TGGCAACGGC GAGCATTGTG GACCTGATGC TGTACACGCG CTTTTCGCCG
GACCGGATCA TTGACTGGAT CGACCAGTCG ATTCTTTACA CCCAGCTCGG ACCTGACCGA
AAGATCGGAA ACAGTGAGGT TACGGTGCGG GCGAAGTTGC GGTCACATGG AATCCGCACA
GCAACGGCAT TGCTGGAAGC GTATCGGAAA GCGACAGACC CAGAAGATAA GAGGGGGTTC
GAGGCGATTC TCGAAAGCGA CGGACGGCCA CCCATCCGTA CGCTGACAGA TGCTCTGCTA
ACAAGTCCGA ACCTGGATCT GGTACGAAAC TGGCGCGCTC TGGAGCCATT TGCGGGAGGC
GAGTGGGTGT CGCATACGGC TCCGGTACAC CATAATCGCG CGTTGGCGGT GCAAGCGAGA
GGGTAG
 
Protein sequence
MPDPDTTPTL VEPTTNVDDP ADNQQVEATK ETSKVAQKRS PTSSDTWIGY VFWLIIIFWI 
LVVKVPPMHA FFWRYWWESL ITLVLLIAVI GSYAPWRQKI RSSVHTRVGS VIVALAGISI
LEGAISVLPQ RYQILALRST FLLVVCLFPV ILYYLFITTR KYSLLNEFIT NLERLGLLRE
RVTTPRSASS AAKEIAPDSA LNLRILTYLQ KFEAVYGSLD PELVSELVAA TDPAAVFAKP
EWNRRATAGF SSIFTPETTA PLVTATGLIA LGWILALPPW QAAAPQPLAA QAKSANTQVL
SAATPGLPTS ATGSTETPLP AKAADAAPST VAQTNTVSWL RALYPDESPV YFAFLGAYFF
SIQMLFRRYV LKDLRSSAYV AVSLRIVMAV MGTWVMIPTA RTLHILDSSS DLNTNSKLLV
LSFVIGVFPP VIWQFLQAAF KKISGARYFL PSLSSELPLS SLDGLTVWHE ARLEEEDIEN
VPNMATASIV DLMLYTRFSP DRIIDWIDQS ILYTQLGPDR KIGNSEVTVR AKLRSHGIRT
ATALLEAYRK ATDPEDKRGF EAILESDGRP PIRTLTDALL TSPNLDLVRN WRALEPFAGG
EWVSHTAPVH HNRALAVQAR G