Gene Acid345_0426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0426 
Symbol 
ID4069652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp497199 
End bp499040 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content55% 
IMG OID637982430 
Productglycoside hydrolase family protein 
Protein accessionYP_589505 
Protein GI94967457 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0129037 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCACG CTGCGAAGAT GTCATCTGTC CTCCTCCTTC TCTGTGCGTC TGCGTTTTGC 
GCGAATGATC TCAAGGTTCT AACCGATCAC GTTGGTTATG AGACCACAGG TGCGAAACAT
GCTGTCGTTC TCGGTAAAGC CGGTGACCGT GTTTCCGAAT GCTCGATCAA GAATTCAACC
GACGACAGAG TTGTTGCGCC GATAAAGGCG GTGGCGGTCG GCCCAGTGAA GAAGTGGCGC
GATTGGTATT TTTGGACATT GGACTTCGAC AGCCTCACGC AGGAAGGCCA CTACTACATT
GAGTGCGCCT CATCGCGAGG CGCAGTGCGA TCGTTTCCCT TTGCCGTTCA AGCCAATCTC
CTGGAACAAG GCACTCTCTC TGATGTCCTG TACTACTTCA AAGACGAACG CAGTTCGGGA
CCAATGGACA AGGCGGACAG TCACCTGCCC TTCGATCCTC CGAAGCAAGG CACTCTCGAT
GCGCATGGTG GCTGGTGGGA CGCGACAGGC GATTACGGGA AGCATCTGTC GCACCTCTCA
TTCTCAACCT ACTTCAATCC GCAGCAGATC CCGCTCGTCG TGTACTCGCT GCTAAAGAGC
TACGGGCAAC TCACTCGGCG GGGACTCCCG GAGGTTACAC GCTACAAAGA CCGCATTCTC
GACGAAGCGA TGTTTGGTGC TGATTATCTC GTGCGCGTAA AAGATCCGAG TGGTTCTTTC
TATCGCTCAA TTTCGACGGG CGGCGTAAAG CAGGTGCCCG AAGAGCGCAA GGTCGCCGGC
GAGATGAAGA AGTTCGCGAT CTACCAGTCG AACGACAAGC GTCCTGACAT GATTGAGAAG
GCGAACAACG ATCTTGAGTA CGAAGTTAGC TATCGTTCTG GCGGCGGCAT TGCGATCGCT
GCTTTGGCGA TGGCTAGCAC TGCTCCTATC TCAGGTGAAT ACAAGAATGC GGATTACCTG
AAGGCTGCCG AAGACGCTTT CGCTTACCTT GAGAAGAACA ACCTGAAAAT GGTCAACGAT
GGCAAAGAGA ACATCGTTGA CGATTACTGT GCACTCACCG CAGCGACTGA GTTGTTCCGC
GCAACGAAGA AGCCAATATA CAAGGAAGCA GCTGATCGCC GCGCGTCAAG CCTAGTGTCG
CGCCTGGCGA GCGATGGTCA GCACCAGAAT TACTGGCGCG CCGACGACCA TGATCGTCCC
TTTTTTCATG CGTCTGATGC CGGTCTTCCT GTGGTTAGCT TGCTGTACTA CGCGGAGGTT
ACCGACGCGC AAACTCGCAC AAAGGTTCTC GAAACGGTTA AGAAGTCGCT CGCTTTCGAG
CTTGCGACTA CGCGTGAGGT CCCAAATCCT TTTGGCTATG CCCGAGAGTT TGTTCAGGAC
AAAACCGGCG CCCGTCGCAC CAGCTTCTTC TTCCCACATA ACAGCGATGC GGCACCGTGG
TGGCAGGGCG AAAATGCGCG GCTCGCATCT TTGTCTTCAG CAGCCAGACT TGCTGCGCTT
CAATTCACCG ACGATCCGGA GTTCGCGAAG CAACTCAATT CGTATGCTCT GAACCAGCTC
AATTGGATCG TTGGACTGAA TCCGTTCGAT TCCTCGATGT TGAACGGCGT CGGCCATAAC
AATCCGCAGT ACCTGTTTTT CGATTCCTGG GAATTCACCA ATGCCCCGGG CGGCATATCG
AACGGCATTA CCAGCGGCTT CCGCGACGAA GACGATATCG ACTTTAACCT TACGTACAAA
CAGACCGGGG CCGACAACGA TTGGCGTTGG CAGGAACAGT GGCTGCCACA TGCGTCGTGG
TATTTGCTTG CGGTTTCAAC GGGCAACACC TCGCCTCGCT GA
 
Protein sequence
MLHAAKMSSV LLLLCASAFC ANDLKVLTDH VGYETTGAKH AVVLGKAGDR VSECSIKNST 
DDRVVAPIKA VAVGPVKKWR DWYFWTLDFD SLTQEGHYYI ECASSRGAVR SFPFAVQANL
LEQGTLSDVL YYFKDERSSG PMDKADSHLP FDPPKQGTLD AHGGWWDATG DYGKHLSHLS
FSTYFNPQQI PLVVYSLLKS YGQLTRRGLP EVTRYKDRIL DEAMFGADYL VRVKDPSGSF
YRSISTGGVK QVPEERKVAG EMKKFAIYQS NDKRPDMIEK ANNDLEYEVS YRSGGGIAIA
ALAMASTAPI SGEYKNADYL KAAEDAFAYL EKNNLKMVND GKENIVDDYC ALTAATELFR
ATKKPIYKEA ADRRASSLVS RLASDGQHQN YWRADDHDRP FFHASDAGLP VVSLLYYAEV
TDAQTRTKVL ETVKKSLAFE LATTREVPNP FGYAREFVQD KTGARRTSFF FPHNSDAAPW
WQGENARLAS LSSAARLAAL QFTDDPEFAK QLNSYALNQL NWIVGLNPFD SSMLNGVGHN
NPQYLFFDSW EFTNAPGGIS NGITSGFRDE DDIDFNLTYK QTGADNDWRW QEQWLPHASW
YLLAVSTGNT SPR