Gene Acid345_0665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0665 
Symbol 
ID4069757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp823007 
End bp824209 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content57% 
IMG OID637982671 
Productsugar isomerase (SIS) 
Protein accessionYP_589744 
Protein GI94967696 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2222] Predicted phosphosugar isomerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATCA AAACAGAGAA AAATTCGGGC AATCCGTTGT CGGCCTTGAT GGAGTTGGCG 
CAGGAAGAAA AAGCCTCCAG GGGCCTGCTG CATACTCCCG CCGAGATCGC GCAGCAGCCC
GAGACATGGC AAGGCACCTA CGCCCGCGTC GATCGCCAGA AGAGCGCGTT GCGCGAATTT
CTCGGCGTGA GCTTGAGCGC CACAACCACC GTTTACCTCA TCGGCGCCGG GACTTCCGAC
TATATCGGCC GTGCGTTGTG CAGCGTTCTC CGACAGAAAT GGCAATGTGA CGTAATCGCG
GTTCCGAGCA CCGAACTCAT CACCAACCTT GAGAACTATG TTCTGCCACG CAGAAACTAC
CTCTGGATTT CCTTTTCACG CTCTGGCGAC AGCTCCGAAG GCGTCGGTGT GCTCGACCTC
GCGATTACGA AATATCCCGC AATCCGGCAC CTCATCGTGG GCTGCAACAA AGACGGCAAG
ATGGCGCAGA TGTGCGCGGA CCGCGACAAT TGCTACGTCC TCCTTCTCGA CGATGCCACA
AACGATCGCG GCCTCGCGAT GACCAGCTCG TTCACCAATA TGGTTATCGC CGGACATTGC
CTGGCAAACA TTGACGATCT CAGTGCTTAT CGTCCTCTGG TTGAAAGTCT CATCGGCATG
GCGCGCGAGA TGCTGCACGT AGCTCCGCCC ATTGCGCTCG AGATCAGCAA GCTTCGCCTT
CGCAAGGCCT GCTACGTGGG CTCAGGCGCC CAGGCAGCCA CCGCCACCGA GTGCGCGCTG
AAGACCGTCG AAATGAGCGC AGGTACTATT CACACCATGG CAGAATCCAC GATGGGTCTG
CGTCACGGAC CGATGTCTGC CCTGGGTGAC GATTCACTCT TTGTCGCCTT CGTCTCCAGC
GATGAACGTC GCCAGCGCTA CGAACTCGAT CTCATTAAGG AGATTCATCG CAAGCAGCTC
GGACGAGTGC GCGTCGCCAT AGCGCCTCCG AACCTGGATG AACTTGTGGA CTACTGCGAA
CACGTCATCA CGCTCGATGC GCCTCCTGAT TTTCCGGACG ACTATCGCGT CCCAGTGGAT
GTCATCTTCG GGCAACTCGT CGGATTGTTC TCCTCCATCC AGGCAGGACT TCAGCCAGAT
CGGCCGAGCC CAAATGGGGC AATCACGCGC GTTGTATCGG ATGTGAACAT ATATCTCGAT
TAG
 
Protein sequence
MTIKTEKNSG NPLSALMELA QEEKASRGLL HTPAEIAQQP ETWQGTYARV DRQKSALREF 
LGVSLSATTT VYLIGAGTSD YIGRALCSVL RQKWQCDVIA VPSTELITNL ENYVLPRRNY
LWISFSRSGD SSEGVGVLDL AITKYPAIRH LIVGCNKDGK MAQMCADRDN CYVLLLDDAT
NDRGLAMTSS FTNMVIAGHC LANIDDLSAY RPLVESLIGM AREMLHVAPP IALEISKLRL
RKACYVGSGA QAATATECAL KTVEMSAGTI HTMAESTMGL RHGPMSALGD DSLFVAFVSS
DERRQRYELD LIKEIHRKQL GRVRVAIAPP NLDELVDYCE HVITLDAPPD FPDDYRVPVD
VIFGQLVGLF SSIQAGLQPD RPSPNGAITR VVSDVNIYLD