Gene Acid345_0430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0430 
Symbol 
ID4069656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp506382 
End bp508517 
Gene Length2136 bp 
Protein Length711 aa 
Translation table11 
GC content57% 
IMG OID637982434 
Productgalactose-binding superfamily protein 
Protein accessionYP_589509 
Protein GI94967461 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0130006 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTCGC GATTTGCTTC CATCTCGTGC GCAATGTTTC TTGCCGCTGC AAGTTTGGGT 
GCTCAAACGA TCGTCGTCGA CGCCGATCCC GCACACGTCG TGAATCGCTT CCGGCCGCAA
TACGCGCTGG GAACCACGGT AGATCGCGTC CCGAGCAACG CAACCGATAC TTTCTTTCGC
CCCGATCAAG TGCAACAAGT GCTGAGCGCG GGATGGGGCG TCGTCAGTTA TCGCCAGAAC
ACTGAGCTCT TCATTCAAGC GTGGCATTGG AACCCTAAGG GCAAGTGGAG CGATCCGAGC
GGCAAGGGCT ATTTCGTCGG CGACGCCACG CCTACGAGTG AGCCGATTCG CCACTCGTTC
GGATACTCGC TCCAGCATCG TGGCTTTACC CGCAACGGGG GCTCCGAGTT CGACGGCTTC
TCTCGTCTTG ATGATGGCGA CATCAAAACT TATTGGAAGA GCAATCCCTA CCTTACACAG
ACGTTCACTG GCGAACCGGA TTCCGCGCAT GCGCAATGGA TCGTGATTGA TTTGGAAAAG
CCGCAGGACA TCAACGCGAT CCGCATCGCA TGGGCCGAAC CGTATGCGCG CGAATACAAC
GTGCAGTACT GGGAAGGTTC GGGCGACGCG ATGGATGAAC AGGATAAAGG CACGTGGAAG
AACTTTTCAT CTGGGGCTGT CGCGAACGGT GAGGGTGGCA CTCCGACGAT CAAGCTCAGC
ACCTCAACGG TCCAAGCGCG ATACGTGCGT GTGCTGATGA CCGGTTCCTC CAATACCTGC
GACACCCACG GATCCAGTGA CAAGAGAAAT TGTGTTGGTT ACGCCATTCG CGAGGTCTAT
CTGGGTACAG TCGATCAGAG CGGCGATTTC CACGACCTGC TCAAACACAC TCCGGGACAG
CAGCAAACAT TGACGTATTG CTCCTCCGTT GATCCCTGGC ATGAACCGTC CGACCTGTAC
GTCGCACCAG ACCGGATGGA ATCGGGAGAT CAGCCCGGCT TCGACTTGTT CTTCACCAGC
GGCATCACCC GCGGACTTCC TGCAATCGTA CCGATTGCGT TGATCTATGG AACGCCAGAA
GATTCCGCGG CCCAAATGGC TTACCTCAAA GCTCGCAAAT ATCCGATCTC ATACATCGAG
ATGGGTGAAG AGGCCGATGG TCAGTACATG CAGCCCGAGG ACAACGCGGC TCTGTACATT
CAGTGGGCGA CAGCCATTCA CAAGGTCGAC CCCACATTCA AGCTGGGCGG CCCGTCGTTC
CAGGGCGTGA CTGAAGACAT CAAGGCGTGG GCTGACCCGA AGGGCAGAAC CTCGTGGTTC
GCGCGTTTCC TTGACTACTT AAAAACCCAC GGCCATCTCG ACGATTTCGC ATTCATGTCG
TTCGAACACT ATCCCTATGA CGGATGCGAA ACGCCGTGGG AGAACCTTTA TCAGGAACCT
GAGCTAATCA TGCACGTTAT GGATGTCTGG CGAGCGGATG GATTGCCGCC AAATATTCCG
CTGCTCGACA CCGAAACCAA CGATCACGGT GGGGAAGCGG CGGTGGACAT CTTCGGCGCG
CTTTGGCTCG GCGATTCTTT CGCCGGTTTC CTTACCGCCG GCGGACAATC CACCCACTAC
TATCACGCAC TTTCGTACTC CCCGCCGCAT CCGGCGTGTC CGAACAGCTG GGGCACTTAT
CACATGTTCA TGGTCGATAA AAACTATGTG ATTCAACGGA AGACCTCGCA ATATTTTGGC
GCGAAAATGC TGACCCAGGA GTGGGTGCAG CCCGGCGATG CTGAACATCA ACTCTTCCGC
GCCACGAGCG ACATCAAAGA CGCCGCAGGA CACACGCTGG TGACTGCCTA CCCGCTACTT
CGCCCGGACG GCAAATGGTC GATTTTGCTG ATCAACAAAG ATCGCGATCA TCCACACGAA
GTGCAGATCA CATTCCGAAA CAGCGGAGCG GGAGGCCTTC GCGGCGCCGT AGAGATGGTC
ACGTTTGGAA AAGCCCAATA CCAGTGGCAT CCTGACCGAA AGAATGGTTA CGCCGACCCG
GACGGGCCGC CCGTCACCAG CAAGTTGACG GCGGGCGAGC AAACTCGCTA CACGCTGCCG
CCCGCTTCGA TCAATGTGCT TCGCGAGGTG CACTGA
 
Protein sequence
MISRFASISC AMFLAAASLG AQTIVVDADP AHVVNRFRPQ YALGTTVDRV PSNATDTFFR 
PDQVQQVLSA GWGVVSYRQN TELFIQAWHW NPKGKWSDPS GKGYFVGDAT PTSEPIRHSF
GYSLQHRGFT RNGGSEFDGF SRLDDGDIKT YWKSNPYLTQ TFTGEPDSAH AQWIVIDLEK
PQDINAIRIA WAEPYAREYN VQYWEGSGDA MDEQDKGTWK NFSSGAVANG EGGTPTIKLS
TSTVQARYVR VLMTGSSNTC DTHGSSDKRN CVGYAIREVY LGTVDQSGDF HDLLKHTPGQ
QQTLTYCSSV DPWHEPSDLY VAPDRMESGD QPGFDLFFTS GITRGLPAIV PIALIYGTPE
DSAAQMAYLK ARKYPISYIE MGEEADGQYM QPEDNAALYI QWATAIHKVD PTFKLGGPSF
QGVTEDIKAW ADPKGRTSWF ARFLDYLKTH GHLDDFAFMS FEHYPYDGCE TPWENLYQEP
ELIMHVMDVW RADGLPPNIP LLDTETNDHG GEAAVDIFGA LWLGDSFAGF LTAGGQSTHY
YHALSYSPPH PACPNSWGTY HMFMVDKNYV IQRKTSQYFG AKMLTQEWVQ PGDAEHQLFR
ATSDIKDAAG HTLVTAYPLL RPDGKWSILL INKDRDHPHE VQITFRNSGA GGLRGAVEMV
TFGKAQYQWH PDRKNGYADP DGPPVTSKLT AGEQTRYTLP PASINVLREV H