Gene Acid345_4089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4089 
Symbol 
ID4072511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4847537 
End bp4849645 
Gene Length2109 bp 
Protein Length702 aa 
Translation table11 
GC content64% 
IMG OID637986120 
Producthypothetical protein 
Protein accessionYP_593163 
Protein GI94971115 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.331742 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.947235 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATCGCA TCGTCGGTGT GCTTTGCGCA GCAATGTTGA TGGGTGGAAC CGCGCTCGCC 
CAAACGAACG AAGCCGCAGC CACGGTTGCT CAGCCAACCG TTCCTCGCCT CATCCGCTTC
GCCGGACAAA TCGCGCCGGA AGCCGCGCCA TCCTCCACCG TCGCGCGCGG AATCACCTTC
AGCCTCTATC GTTCCGAGAA CGACACATCT GCGCTTTGGA CTGAAACCCA AAACGTCTCG
CTCGAAAAAG ACGGCAAATA CACTGTCCTT CTCGGCGCCA CGAAATCCGA AGGTCTTCCC
GCGGACATCT TCACTTCTGG CGAAGCGCAA TGGCTCGGTA TCCGCGTCGA AGGTTCTCAC
GAGAAACGCG TCCTCCTCGT CAGCGTTCCC TACGCGCTTC GCGCCGCCCA GGCCGACACT
CTCGCTGGAC ATCCCGCCCT CGACTTCGTC ACCACCGACA AACTCACCAC CGTCGTCAAA
GAACAAATCG CGCAGCAGAC CACGACCCTC GGCAAAGGCC GCACTGCTGC TGGCGCAATT
AAGAACGCAG TTTCCGCCAC CCCGACGAAC TTCACCGGGA GCACCACCGA CCAGATCGTC
GGCGTCACGC AAAGCAGCAC CGGCATGGGA ATCAATGTTT CCAGCGGCAC TGGCTACGGC
GTGTACTCGA AGAGCACCGG CAGCGCTCTC TACGGCGTCA GCACCTCAAC GTCGCTCGCC
GCCTACGGTG TCTTCGGATC TTCAGCCTCT GCCGCCGGCT ACGGCGTCTT CGGCTCCAAC
ACTTCGCCGA CCGGCACCGC CGTCGGCATC CGCGGTACCT CGAGCTCGCC CGGTGGCATC
GCGGTGTACG GCACCGCAAA CACGGCCACC GGCACGGCAA CCGGCGTGAA AGGAATTACC
CAATCGCCAG ACGGTTACGG CGTCTTCGGC CAGAACACTG CCACCACCGG CGTAGCCATC
GGCTTCCGCG GATCAACCGC ATCCACGGCG GGCGTCGCGA TCTACGGCAC CGCCACCGCC
ACCACCGGCG CCACCACTGG CATGCGCGCC ACGGTCGCCA GCGCTAACGG CGTCGCCGCT
CTCTTCCTCA ATAGCGCGCA CGGGAAGCTC CTCAGCGGCA TCGTCGGAAC GAGCACCGAA
GTGTTCAGCG TGGATGGCGC CGGCAACATC GTCGGGGGAT CGCTCAACGC ATCATTCGTG
CAAGGTGCCT CAACTACCCT TGGCATTTTC GGCGGATTAT TTAGCGGCGC CAATGGCGGC
GGCACAGCCG TCTCCGCCAA CGGTGGCGCG GCCGCTGCCT CCGCAACCAC GGCCGGCGCG
GGACTCGTCG CGACCGGCGG TACGCTCGCC AGCGGCGTGA CCAACGCCAC CGGCGGCGAC
GGCGCCGATT TCTACGGTGG TGATGGCGAT TCCGGCACCG GCAACGGCGT CAGCGGAACC
GGCGGCAACG TCGCGAATGC CGCGGCGATC ACCGGCGGAT ACGGCGGATA CTTCATCGGC
GGCGGCCCCA ATGGCGACGG CCTCTACGCC GCGCCCTCCA TAGGCGGCAC CGGCAACGCC
GCTACGCTCG ACGGCAACGT TACTGTTACC GGCCTGCTCA CCACATCGTC CGCCGCGCGC
ATGCAAATCG ACGATCCGCT CGATCCCGAA AGCAAAATCC TCGAGCACAG CGGCATCCAA
TCGTCAGAAT TGTTGAACGT ATACTCCGGC AACGCGACCA TCGGCGCCAA TGGCCGCGCT
GCCATCAAGC TGCCCGCGTG GTTCGAGGCT GTTAACACTG ATTTCCGCTA TCAACTCACG
CCCATCGGCG CGCCCGCTTC GCTCTACGTC AGCGCACCGA TCGCAAAAGG CGCGTTTGAA
ATTGCAGGTG GCACGCCGGG CATGACCGTC TCCTGGCAGA TCACGGCGGT CCGCAAAGAT
CCGTATCATC TCGCGCACCC GCTACAAGTC GAAACCGACA AAACCGCGGC GCAGCGCGGC
CTCTACCTTC ATCCGGAAGC CTACGGTGCC TCGCCCGACA AACGCCTCGG CGCTGTTCAA
CATTTACCGA CCAATCGCAA ACCGCGCGCG GCGAAAGGCG CTGTTTCAAA AGCTCCGCAG
GACAACTAG
 
Protein sequence
MNRIVGVLCA AMLMGGTALA QTNEAAATVA QPTVPRLIRF AGQIAPEAAP SSTVARGITF 
SLYRSENDTS ALWTETQNVS LEKDGKYTVL LGATKSEGLP ADIFTSGEAQ WLGIRVEGSH
EKRVLLVSVP YALRAAQADT LAGHPALDFV TTDKLTTVVK EQIAQQTTTL GKGRTAAGAI
KNAVSATPTN FTGSTTDQIV GVTQSSTGMG INVSSGTGYG VYSKSTGSAL YGVSTSTSLA
AYGVFGSSAS AAGYGVFGSN TSPTGTAVGI RGTSSSPGGI AVYGTANTAT GTATGVKGIT
QSPDGYGVFG QNTATTGVAI GFRGSTASTA GVAIYGTATA TTGATTGMRA TVASANGVAA
LFLNSAHGKL LSGIVGTSTE VFSVDGAGNI VGGSLNASFV QGASTTLGIF GGLFSGANGG
GTAVSANGGA AAASATTAGA GLVATGGTLA SGVTNATGGD GADFYGGDGD SGTGNGVSGT
GGNVANAAAI TGGYGGYFIG GGPNGDGLYA APSIGGTGNA ATLDGNVTVT GLLTTSSAAR
MQIDDPLDPE SKILEHSGIQ SSELLNVYSG NATIGANGRA AIKLPAWFEA VNTDFRYQLT
PIGAPASLYV SAPIAKGAFE IAGGTPGMTV SWQITAVRKD PYHLAHPLQV ETDKTAAQRG
LYLHPEAYGA SPDKRLGAVQ HLPTNRKPRA AKGAVSKAPQ DN