Gene Acid345_1756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1756 
Symbol 
ID4070636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2127619 
End bp2129463 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content59% 
IMG OID637983764 
Producthypothetical protein 
Protein accessionYP_590831 
Protein GI94968783 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCCA CTGTTCCCGC TCGAAGCCGC TCGCACTTCC ATCGCATTTT TGTTTCGGTT 
TCCGCGTCAG CACTTTTGAT TGGGCTTGCC GGGTGCGGCA GCACTTCAGC TCGCGTGAGC
GATTCATCGG TGGACTTCGG CCAGGTGGCG ATGGGAACGC AAGTGCGAAG GATCGCGGTG
AATGTGACCG CATCCGGCGA CAACGACGTT ACGGTTGCGC CGAAACTCAG CGGGTCGGGA
GATTTCTCCA TCGCTCCGGA CGTGGGATGC GGGACAAAGC TGGCTTCCCA TGGAACGTGC
TCGGTGGTCG TGGTATTCAC ACCAACGAGT ACCGACAGCG CGACGGCGAC CCTGGATTTG
GGGTTGTCGG TGAATGACCA ACAGATATCG CTCACCGGCA GCGGCGTACA ACTCACACCG
GGACAGAGCC TGGTCACTGC GACCGACAGT CCTCTGGTAG CGCTCTATAC GTATGCGCCG
CAGGGTTCGG GTGACGTGCA CATCGAGTTC GGTCCGGACA CCAATTACGG GCTGGAGACT
TCGGCGGTGA CGCCCAGCGC AGGCTCTCCC GTGACGATCT ATGTCGCCGG GATGCGTGCG
AACTCGACGT ATCACATGCG CGCTCGGACG AGCGGCGGCG AAACGGCGAT GGACCAGACA
TTCACGACCA CGAAGATCAC CAGCGGTACG CTGCCGCCGA TCACCGCAAC GAGCAGTGGA
ACACCGCAGT CAGGGATCGA GTTCCTGAAT CCGGGATTCA GCACGGCGAG CACTAACTTT
ATCGAGGCGT ACGCGGTTGA TCTGGAGGGC AACGTCGTTT GGGCGTATGA CTATCCCGAA
CGTACGACCA ATGCACTGCT GCAATCCTTC CACGTGTTGC CGGACGGCAA CATTCTCGCG
CTGATTGGGA CGAGTTCGGC TGTGGCACCA AAATCGGGTG ATCCCATCCT GCTCCGCGAA
TTCAATCTTG GCGGCGTGCC GGTACGCGAT GTGACGCTGG ATCAAATCAA CGCGCAGCTG
TCGGGAATCA GCCTGGTGGA TTTGCATCAC GCGGTGACCG TGCTACCAAA CGGACACTGG
CTGGCGCTCG CCAATGCCTA CAAAAGCTTC GACGGCTTGC CCGGGCAGGG CGATGGAACG
AAGGTTCTCG GCGACGTGAT TGTGGACGTG GACCCGAGCG GCAAGGTCGT GTGGACGTGG
AGCGAGTTCG ATACGCTCGA CGTGAAACGC GCGCCCGATG GTTATCCGGA TTGGACGCAC
AGCAACGCAA TTGTTTATGA GCCGGATGAC GGCAATCTTC TGGTGTCGAT TCGCCACCAG
AATTGGGTTG TGAAGGTGGA TTATCGGAAC GGTGCGGGCA CGGGAAAAAT TCTGTGGCGG
CTTGGATATC AGGGCGACTT CAAGCTGGTT AATGGGACCG AGCCGCAGGA TTGGTTTTTC
GGGTCGCACC AGCCGCAGTT TGTGAGTTCG TCCACGGCGG CCATGTTCGA CCTCACCTTG
ATGGACAACG GCTACAATCG GCAGCTCACG CCGGGGGCCG CATGCACTGG TACAGGCTGC
TACACGGCAA TCCCGATCTA TCACGTGGAC GAAGCTGCTA AGACGGCGAC GATCTTGTGG
CGCGACGCGA TCGATCCATC GAAGTTCTCT GTGTGGGGCG GCGGAACAAC CGTACTCGAT
AATGGAAATC TTGAGTTCGA TCTCTGCGCG TTGGGCAACG ACTCCGAGGT AGACGAGGTC
ACCGTGACCG GTACGCCACA GACGGTGTGG ACATTGAAAG TCACCGGACA GAATCTCTAC
CGGGCGAACC GGATGCCGAG CTTGTATCCG GGGGTGCAGT GGTAG
 
Protein sequence
MKATVPARSR SHFHRIFVSV SASALLIGLA GCGSTSARVS DSSVDFGQVA MGTQVRRIAV 
NVTASGDNDV TVAPKLSGSG DFSIAPDVGC GTKLASHGTC SVVVVFTPTS TDSATATLDL
GLSVNDQQIS LTGSGVQLTP GQSLVTATDS PLVALYTYAP QGSGDVHIEF GPDTNYGLET
SAVTPSAGSP VTIYVAGMRA NSTYHMRART SGGETAMDQT FTTTKITSGT LPPITATSSG
TPQSGIEFLN PGFSTASTNF IEAYAVDLEG NVVWAYDYPE RTTNALLQSF HVLPDGNILA
LIGTSSAVAP KSGDPILLRE FNLGGVPVRD VTLDQINAQL SGISLVDLHH AVTVLPNGHW
LALANAYKSF DGLPGQGDGT KVLGDVIVDV DPSGKVVWTW SEFDTLDVKR APDGYPDWTH
SNAIVYEPDD GNLLVSIRHQ NWVVKVDYRN GAGTGKILWR LGYQGDFKLV NGTEPQDWFF
GSHQPQFVSS STAAMFDLTL MDNGYNRQLT PGAACTGTGC YTAIPIYHVD EAAKTATILW
RDAIDPSKFS VWGGGTTVLD NGNLEFDLCA LGNDSEVDEV TVTGTPQTVW TLKVTGQNLY
RANRMPSLYP GVQW