Gene Acid345_0833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0833 
Symbol 
ID4072359 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1033911 
End bp1035656 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content55% 
IMG OID637982842 
Producthypothetical protein 
Protein accessionYP_589912 
Protein GI94967864 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTACCATC TGAGAAAGCT GGTGCAGCGC GCCAGACGAG GCGTGGCCTG CGCGCTATTG 
GCTTTGGCGG CTTGTACTAC CGTTAGTGTG GCAGCTTTCC CGCAAAGTCA GCCGACCATC
GGTTCGACGT TTGTACCGCT CGATAGTTGG GTTTATCCGG CGTTAGATCG CCTGCGTGCC
CTTGGCTACA CGCACACGCA GTTCATGGGT CTCAGGCCTT GGACACGCCT GATGTGCGCG
CGACTCGTTC ACGAAGCTGC ACAGTCTTTG CGGCCCGGTG ACCTCAACGC GACCAAGATG
TATAACCAGC TCGCAGATGA GTTCCAGCCG GAACTAGGGT ATCTAGGTGG GGACCCGAGC
CAGACGGTCA CAATCGATTC GGTCTACGGT CGAGTTATGG GCATCGGTGG TGACGAGCCG
TTGCGCGACA GTTGGCACTT CGGCCAAACC ATCGCGAATG ATTTTGGCAG GCCATTCGGT
CAAGGCGTAA ACGCGGTCGT TGGGATGAGC GCGCGCGCAC AGCGGGGAAG ATTCTTCATC
GCGTTTCGCG GTGAATATCA GCATGCTCCA GCTTTGCCGG GCTTCGATTC GAATGTACAA
AGCGTTATCG CGAAAATTGA TCAAACGAGC AGTGCTCCTC TCCTGTTTCC GCAAGATCAG
CGAGACCGTT TCAATCTGCT GGACACATAC GCCGGTGTGG CGTTCGGAGC CTTTGAGCTA
ACTTTCGGCA AGCAAAGCCT TTGGTACGGT CCCGGAACCA GCGGGGCATT GCTGTTCAGC
AACAATATTG ACCCTCCGTA TATGTTGAGG CTTGATCAAG TGAATCCGGT TCGGCTGCCC
TCGTTTCTTA AGTATTTGGG GAATATCCGC ACCGAGTTTT TCTTTGGAAA GTTGTCGGGA
CATTCTTTCC CGGCACGACC GTTCATGCAT GGAGAAAAGG TCACTCTCAA GCCGACTGAC
AATCTTGAGG TCGGCTTTAC GCGCATGACC GTATTCCTGG GCGAAGGCAA TGGGTTTACT
CTCGGGCGCA TCATCCATAG TTATTTCAGT GTCGGAGACA ATCTCGGAAG CAATCGTTCG
AACAGCGATC CGGGCGACCG TAAAGGCGGA CTCGATGCAA GCTACCGCGT GCCGGGTTTG
CGCGATTGGG TCACGATTTA TACAGATTCC TTTACGGATG ATGACCCTTT ACCGCTTTCT
GCTCCGCACC GCGCGGCCTG GAACCCCGGT ATTTACATGC CGAAGCTTCC AGGATTGCCG
AGTCTGGATC TCCGTGTGGA AGGGGTAACC ACGGATATCC ACTCCGAAGC GACGGTTGGT
CACTTCGTTT ACTACAACGG CATATACAAG GACGGATATA CCCAAAACGG CTTTATCATT
GGAAATACAA TCGGACGAGG CGGGCGTGCC ATACAGGCGA CCAGTACCTA CTGGTTTAAC
GCGCGCAACG ACATCCAGGT GGGCTTCAAG ACGGGAACGG TGGATTACAG GTATATCCCG
GGCGGCGGCG GCCAGAAGGA TTACAACGTT CGCGCCGACT GGTTAGTGAA GAAAAACATC
GCCCTGTCTG GATTTGTTCA GTACGAGCAC TGGAGTTTCC CGCTGCTGGC GGCGACCCCG
CAAAACAACG TAGCAGCGTG GTTGTCCATC ACCATCGATC CGAAATTGGA ATGGGGTCAC
GCCCGCACTG CGCTCCATCG TGATTCCACG AGTCGTCCGT CTACTCAGGA TTTAAAGCAG
GAGTAA
 
Protein sequence
MYHLRKLVQR ARRGVACALL ALAACTTVSV AAFPQSQPTI GSTFVPLDSW VYPALDRLRA 
LGYTHTQFMG LRPWTRLMCA RLVHEAAQSL RPGDLNATKM YNQLADEFQP ELGYLGGDPS
QTVTIDSVYG RVMGIGGDEP LRDSWHFGQT IANDFGRPFG QGVNAVVGMS ARAQRGRFFI
AFRGEYQHAP ALPGFDSNVQ SVIAKIDQTS SAPLLFPQDQ RDRFNLLDTY AGVAFGAFEL
TFGKQSLWYG PGTSGALLFS NNIDPPYMLR LDQVNPVRLP SFLKYLGNIR TEFFFGKLSG
HSFPARPFMH GEKVTLKPTD NLEVGFTRMT VFLGEGNGFT LGRIIHSYFS VGDNLGSNRS
NSDPGDRKGG LDASYRVPGL RDWVTIYTDS FTDDDPLPLS APHRAAWNPG IYMPKLPGLP
SLDLRVEGVT TDIHSEATVG HFVYYNGIYK DGYTQNGFII GNTIGRGGRA IQATSTYWFN
ARNDIQVGFK TGTVDYRYIP GGGGQKDYNV RADWLVKKNI ALSGFVQYEH WSFPLLAATP
QNNVAAWLSI TIDPKLEWGH ARTALHRDST SRPSTQDLKQ E