Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0888 |
Symbol | |
ID | 4069138 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1105492 |
End bp | 1108314 |
Gene Length | 2823 bp |
Protein Length | 940 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637982895 |
Product | polysaccharide export protein |
Protein accession | YP_589965 |
Protein GI | 94967917 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1596] Periplasmic protein involved in polysaccharide export |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.434689 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCGCT TCGTTTTTAA CACGATGCTT TGGAGCGGAT TCACTTGGGT CCTGTTGGCG GCGATGGGGG TTGCTCAGCA ACTGTCGACG CCGCTGCCTG CCCAAAGCGA CATAACAGAT GCGCGTTCTT CCGAGAGCAT GTCACTTCCA GCGTCCGCAC TCATCAATCT TCTCAAACAA CGTCCGGACT TGGTGATCGA GATTAAACGC GCCGCAGCGA CATACCTTCA AGCGAAAGGA ATGGACGTTT CGGAAGATGT AATCACCGAC GACATGCTGT TCGAACGCAT CAATACTGAT CCGGATTTCC GCAAGTCGCT GACGTCGTGG TTGTGGACGC GCGGATACAT CAATCAATCG GATATCGAGA ATGCGGCGTT GAGCCAATCG AATTCTGGTG CCGAAGAAAG CGGGAGCACG CAACCCTTCG ACTCACAGCT ACCTACGACG TCGAATAGAG CGCAGAAAGT TCGTCCCTCG GGCCAAGAAC CCGATCGAGA GCGGTATTCG AATTCCGCAA GTGCTGGCGT ACAAGCGACG GGACCTGCGC AACCTCGTTC GCGCGTATCG GAGGAAGCTG GCGATCCGAA CCAGCCTACT CAAGACGGCC TAGTACACCA ACCCACTCCA CTGAACCTTC TTGCGCTTCG AGATCTATAC ACGCAAGTTC CGGAACCAAG CTCGTCTCTT CGCCGGTTTG GTTCGGACAC ATTTCTGCAA CACGGACAGA GCGCAGAAGC CTCGATCGAT CTCCCAGCTG GACCGGAATA CGTTCTTGGC CCCGGGGATG TTCTGACTTT AAGCATGTGG GGAAGCATTT CCCAGACATT GCCGCGAACT GTGGACAGGG AAGGCCGGAT TGTGCTGCCC GAAGCTGGTC CTGTCTCGGT CGCGGGACTC ACGCTCGAGC AAGCGCAAGC TCTGACAGAG AAAATGCTGC GACCGCAGTT TCGAGATGTG CGGGTACAAC TATCCCTTGC GCGAGTGCGC ACTGTGCGAA TCTACGTAGT CGGTGATGTG CAACGTCCCG GTGCATACGA TGTGAGCGCC TTGTCGACGG TGGTGAATGC ACTGTTTGCT GCGGGTGGGC CTACTGCTAT CGGATCCTTG CGGACAGTGC GTCACATGCG AAACAAGGAG CTGGTGCGCG AAGTTGATCT GTACGATTTT CTCTTGCGAG GAATTCACGC GGACGTAGAA CGCCTCGAGC CTGGTGACAC GGTTCTCGTC CCTCCGGCGG GCCGCCAGGT GACTGTCTCC GGGATGGTCC GGCGCCCTGC AATCTACGAG CTTCGAGGCG AGAGATCGAT CGACGACGTC GTCGCTCTCG CCGGCGGACT CCTCGTCTCG GCGTCAACCT CACAGATACG GATCGAACGT GTCAGGGCGA ACACAGCTCG CGTAACGGAC GAGATCACAG TTAAAAACTC GGACGATGCG TCTTCGGTTC GGGCATCGCT GCAAGCGTAT GCGGTCGAAG ACGGTGACAG AGTCGTGATT GCTCCGATCC TTCCGTATAG CGAACGTGCC ATTTACGTGG AAGGGCACGT AATTCGCCCC GGCAAGATTG CATATCGCGA CAACATGTCT GTTACCGATG TGATCAGATC TTATAGAGAC CTGCTGCCTG AACCCGCTGA GCGGGCAGAA ATCATCCGGT TGCGTCCTCC CGACTTTCGG CCGGAAACCA TCGAATTCAA CTTGTCAGAG GCGTTGATCG GCAATAGCCA GATTCACTTG CAACCGTTCG ACACGATTCG TGTCCTGGGC AGATACGAGT TCGATGCTCC GAAGGTCACG ATTCAAGGCG AAGTCTTACG GCCAGGCACC TATCCCTTAC CCGAGAAACT TACCGTCGCT CAGCTAGTGC GCTTGGCAGG AGGTTTTAAG CGTTCAGCAT TAAAAGACCA CGCTGATCTC ACCAGCTATG ACGTGCAGCA AGGCGCCAGG GTCACGAGTC ATCGCCTGTC GATTGATATT GGGCGGGCGG TTGATGATGC CGACTCCGCG GCTGACGTCG CACTGAAAAC CGGCGATGTG TTAACAATTC ACCAGATCGC TGGCTGGAAC GACATCGGCG CCTCTGTGAC TCTAGGAGGC GAGGTTGCGT ACCCCGGAAC GTATGGCATT CAGGAAGGGG AGCGACTCAG CTCGGTGCTG AAACGTGCGG GTGGTTTCCG CGACACTGCC TATCCGACCG GTGCGTATTT GTCTCGTGTC CAGGTGCGAG ATTTCGAAGA GAAGAGCCGC AACGAACTCA TCCGACAAAT TGAAACTACC TCTGCAGCAA CAAAGATTTC GCCTTCTCTG AGTACCACCG AGCAGGCGGC GACGCTGCAA TTGATCACGC AGCAGCAGGA ACAAGTACTG CAGCGGCTGC GGAATCAGCC GTCGACGGGA CGGCTTGTGA TCAAGATCAA TAGTGACATT GCCACGTGGG AAGGTACTCC CATCGATATC GAACTTAGAT CTGGCGACGT GCTGACGATT CCCAAGCGAC CTGGATTCGT GCTCGTCACT GGGCAGGTTT ACAACTCGAC CGCGATCACG TATGTGCCGG GGAGGGATGC GAATTGGTAT TTGCATCGTG CAGGTGGACC GACCGCAATG GCGAGCAAGA AAGAAATTTT CGTCATTCGG GCGAACGGCT CTGTTGTAGG TCGCGAGTCC GATGAGAGCG CGCTTCACGC CAAGTTGGAC GCGGGCGACG TGGTAGTTGT GCCTCAAAAG ATCATAGGCG GCTCGATGTT CTGGCGAAAT CTCCTGGCAA CCGCGCAATT CGTGGCTTCT TTTGCGATCA CTGCTAAGGT CGCTGGACTT TAA
|
Protein sequence | MRRFVFNTML WSGFTWVLLA AMGVAQQLST PLPAQSDITD ARSSESMSLP ASALINLLKQ RPDLVIEIKR AAATYLQAKG MDVSEDVITD DMLFERINTD PDFRKSLTSW LWTRGYINQS DIENAALSQS NSGAEESGST QPFDSQLPTT SNRAQKVRPS GQEPDRERYS NSASAGVQAT GPAQPRSRVS EEAGDPNQPT QDGLVHQPTP LNLLALRDLY TQVPEPSSSL RRFGSDTFLQ HGQSAEASID LPAGPEYVLG PGDVLTLSMW GSISQTLPRT VDREGRIVLP EAGPVSVAGL TLEQAQALTE KMLRPQFRDV RVQLSLARVR TVRIYVVGDV QRPGAYDVSA LSTVVNALFA AGGPTAIGSL RTVRHMRNKE LVREVDLYDF LLRGIHADVE RLEPGDTVLV PPAGRQVTVS GMVRRPAIYE LRGERSIDDV VALAGGLLVS ASTSQIRIER VRANTARVTD EITVKNSDDA SSVRASLQAY AVEDGDRVVI APILPYSERA IYVEGHVIRP GKIAYRDNMS VTDVIRSYRD LLPEPAERAE IIRLRPPDFR PETIEFNLSE ALIGNSQIHL QPFDTIRVLG RYEFDAPKVT IQGEVLRPGT YPLPEKLTVA QLVRLAGGFK RSALKDHADL TSYDVQQGAR VTSHRLSIDI GRAVDDADSA ADVALKTGDV LTIHQIAGWN DIGASVTLGG EVAYPGTYGI QEGERLSSVL KRAGGFRDTA YPTGAYLSRV QVRDFEEKSR NELIRQIETT SAATKISPSL STTEQAATLQ LITQQQEQVL QRLRNQPSTG RLVIKINSDI ATWEGTPIDI ELRSGDVLTI PKRPGFVLVT GQVYNSTAIT YVPGRDANWY LHRAGGPTAM ASKKEIFVIR ANGSVVGRES DESALHAKLD AGDVVVVPQK IIGGSMFWRN LLATAQFVAS FAITAKVAGL
|
| |