Gene Acid345_0888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0888 
Symbol 
ID4069138 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1105492 
End bp1108314 
Gene Length2823 bp 
Protein Length940 aa 
Translation table11 
GC content56% 
IMG OID637982895 
Productpolysaccharide export protein 
Protein accessionYP_589965 
Protein GI94967917 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.434689 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCGCT TCGTTTTTAA CACGATGCTT TGGAGCGGAT TCACTTGGGT CCTGTTGGCG 
GCGATGGGGG TTGCTCAGCA ACTGTCGACG CCGCTGCCTG CCCAAAGCGA CATAACAGAT
GCGCGTTCTT CCGAGAGCAT GTCACTTCCA GCGTCCGCAC TCATCAATCT TCTCAAACAA
CGTCCGGACT TGGTGATCGA GATTAAACGC GCCGCAGCGA CATACCTTCA AGCGAAAGGA
ATGGACGTTT CGGAAGATGT AATCACCGAC GACATGCTGT TCGAACGCAT CAATACTGAT
CCGGATTTCC GCAAGTCGCT GACGTCGTGG TTGTGGACGC GCGGATACAT CAATCAATCG
GATATCGAGA ATGCGGCGTT GAGCCAATCG AATTCTGGTG CCGAAGAAAG CGGGAGCACG
CAACCCTTCG ACTCACAGCT ACCTACGACG TCGAATAGAG CGCAGAAAGT TCGTCCCTCG
GGCCAAGAAC CCGATCGAGA GCGGTATTCG AATTCCGCAA GTGCTGGCGT ACAAGCGACG
GGACCTGCGC AACCTCGTTC GCGCGTATCG GAGGAAGCTG GCGATCCGAA CCAGCCTACT
CAAGACGGCC TAGTACACCA ACCCACTCCA CTGAACCTTC TTGCGCTTCG AGATCTATAC
ACGCAAGTTC CGGAACCAAG CTCGTCTCTT CGCCGGTTTG GTTCGGACAC ATTTCTGCAA
CACGGACAGA GCGCAGAAGC CTCGATCGAT CTCCCAGCTG GACCGGAATA CGTTCTTGGC
CCCGGGGATG TTCTGACTTT AAGCATGTGG GGAAGCATTT CCCAGACATT GCCGCGAACT
GTGGACAGGG AAGGCCGGAT TGTGCTGCCC GAAGCTGGTC CTGTCTCGGT CGCGGGACTC
ACGCTCGAGC AAGCGCAAGC TCTGACAGAG AAAATGCTGC GACCGCAGTT TCGAGATGTG
CGGGTACAAC TATCCCTTGC GCGAGTGCGC ACTGTGCGAA TCTACGTAGT CGGTGATGTG
CAACGTCCCG GTGCATACGA TGTGAGCGCC TTGTCGACGG TGGTGAATGC ACTGTTTGCT
GCGGGTGGGC CTACTGCTAT CGGATCCTTG CGGACAGTGC GTCACATGCG AAACAAGGAG
CTGGTGCGCG AAGTTGATCT GTACGATTTT CTCTTGCGAG GAATTCACGC GGACGTAGAA
CGCCTCGAGC CTGGTGACAC GGTTCTCGTC CCTCCGGCGG GCCGCCAGGT GACTGTCTCC
GGGATGGTCC GGCGCCCTGC AATCTACGAG CTTCGAGGCG AGAGATCGAT CGACGACGTC
GTCGCTCTCG CCGGCGGACT CCTCGTCTCG GCGTCAACCT CACAGATACG GATCGAACGT
GTCAGGGCGA ACACAGCTCG CGTAACGGAC GAGATCACAG TTAAAAACTC GGACGATGCG
TCTTCGGTTC GGGCATCGCT GCAAGCGTAT GCGGTCGAAG ACGGTGACAG AGTCGTGATT
GCTCCGATCC TTCCGTATAG CGAACGTGCC ATTTACGTGG AAGGGCACGT AATTCGCCCC
GGCAAGATTG CATATCGCGA CAACATGTCT GTTACCGATG TGATCAGATC TTATAGAGAC
CTGCTGCCTG AACCCGCTGA GCGGGCAGAA ATCATCCGGT TGCGTCCTCC CGACTTTCGG
CCGGAAACCA TCGAATTCAA CTTGTCAGAG GCGTTGATCG GCAATAGCCA GATTCACTTG
CAACCGTTCG ACACGATTCG TGTCCTGGGC AGATACGAGT TCGATGCTCC GAAGGTCACG
ATTCAAGGCG AAGTCTTACG GCCAGGCACC TATCCCTTAC CCGAGAAACT TACCGTCGCT
CAGCTAGTGC GCTTGGCAGG AGGTTTTAAG CGTTCAGCAT TAAAAGACCA CGCTGATCTC
ACCAGCTATG ACGTGCAGCA AGGCGCCAGG GTCACGAGTC ATCGCCTGTC GATTGATATT
GGGCGGGCGG TTGATGATGC CGACTCCGCG GCTGACGTCG CACTGAAAAC CGGCGATGTG
TTAACAATTC ACCAGATCGC TGGCTGGAAC GACATCGGCG CCTCTGTGAC TCTAGGAGGC
GAGGTTGCGT ACCCCGGAAC GTATGGCATT CAGGAAGGGG AGCGACTCAG CTCGGTGCTG
AAACGTGCGG GTGGTTTCCG CGACACTGCC TATCCGACCG GTGCGTATTT GTCTCGTGTC
CAGGTGCGAG ATTTCGAAGA GAAGAGCCGC AACGAACTCA TCCGACAAAT TGAAACTACC
TCTGCAGCAA CAAAGATTTC GCCTTCTCTG AGTACCACCG AGCAGGCGGC GACGCTGCAA
TTGATCACGC AGCAGCAGGA ACAAGTACTG CAGCGGCTGC GGAATCAGCC GTCGACGGGA
CGGCTTGTGA TCAAGATCAA TAGTGACATT GCCACGTGGG AAGGTACTCC CATCGATATC
GAACTTAGAT CTGGCGACGT GCTGACGATT CCCAAGCGAC CTGGATTCGT GCTCGTCACT
GGGCAGGTTT ACAACTCGAC CGCGATCACG TATGTGCCGG GGAGGGATGC GAATTGGTAT
TTGCATCGTG CAGGTGGACC GACCGCAATG GCGAGCAAGA AAGAAATTTT CGTCATTCGG
GCGAACGGCT CTGTTGTAGG TCGCGAGTCC GATGAGAGCG CGCTTCACGC CAAGTTGGAC
GCGGGCGACG TGGTAGTTGT GCCTCAAAAG ATCATAGGCG GCTCGATGTT CTGGCGAAAT
CTCCTGGCAA CCGCGCAATT CGTGGCTTCT TTTGCGATCA CTGCTAAGGT CGCTGGACTT
TAA
 
Protein sequence
MRRFVFNTML WSGFTWVLLA AMGVAQQLST PLPAQSDITD ARSSESMSLP ASALINLLKQ 
RPDLVIEIKR AAATYLQAKG MDVSEDVITD DMLFERINTD PDFRKSLTSW LWTRGYINQS
DIENAALSQS NSGAEESGST QPFDSQLPTT SNRAQKVRPS GQEPDRERYS NSASAGVQAT
GPAQPRSRVS EEAGDPNQPT QDGLVHQPTP LNLLALRDLY TQVPEPSSSL RRFGSDTFLQ
HGQSAEASID LPAGPEYVLG PGDVLTLSMW GSISQTLPRT VDREGRIVLP EAGPVSVAGL
TLEQAQALTE KMLRPQFRDV RVQLSLARVR TVRIYVVGDV QRPGAYDVSA LSTVVNALFA
AGGPTAIGSL RTVRHMRNKE LVREVDLYDF LLRGIHADVE RLEPGDTVLV PPAGRQVTVS
GMVRRPAIYE LRGERSIDDV VALAGGLLVS ASTSQIRIER VRANTARVTD EITVKNSDDA
SSVRASLQAY AVEDGDRVVI APILPYSERA IYVEGHVIRP GKIAYRDNMS VTDVIRSYRD
LLPEPAERAE IIRLRPPDFR PETIEFNLSE ALIGNSQIHL QPFDTIRVLG RYEFDAPKVT
IQGEVLRPGT YPLPEKLTVA QLVRLAGGFK RSALKDHADL TSYDVQQGAR VTSHRLSIDI
GRAVDDADSA ADVALKTGDV LTIHQIAGWN DIGASVTLGG EVAYPGTYGI QEGERLSSVL
KRAGGFRDTA YPTGAYLSRV QVRDFEEKSR NELIRQIETT SAATKISPSL STTEQAATLQ
LITQQQEQVL QRLRNQPSTG RLVIKINSDI ATWEGTPIDI ELRSGDVLTI PKRPGFVLVT
GQVYNSTAIT YVPGRDANWY LHRAGGPTAM ASKKEIFVIR ANGSVVGRES DESALHAKLD
AGDVVVVPQK IIGGSMFWRN LLATAQFVAS FAITAKVAGL