Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_2236 |
Symbol | |
ID | 5162638 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 2594078 |
End bp | 2596924 |
Gene Length | 2847 bp |
Protein Length | 948 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640549730 |
Product | polysaccharide export protein |
Protein accession | YP_001230992 |
Protein GI | 148264286 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1596] Periplasmic protein involved in polysaccharide export |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.987184 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA TCTTGAGTAA CAGTATGCTG ACCTTGCTTC TGCTCCTTGT TTTCGTGCCT GCATTTGCGT TGACGGCAGA TGAAATGAAA AAAATCGAAG AACAGCAGGG TATGTACACA GGGGCGGAAA AGCAGGGAAT CCCTCTTGGA GAACTGAATG AATTGAAGGG GAGAAGTACA TCTGATATCA CTAAGGGGGT CCCTTCCGAT AAGGACAAAT CTTTAACGGA GCCGCCCGGT AATAAAGCCA GGATTTTTAT GAAAACCGAG CCGGGCGACG GGCTCATTTC CCTTAGTTGG GACATCAAGG GACTTCAACA GAAACCTGGC GATCCGCCGT TGAAATATAC CCTGTTTTAC GGAACCGAGT CCGGGAGTTA TGACAAAAAA CTCGATGTTG GTAATGTACG GGATTATAAA CTCCGGGGAT TGAAAAACCA CCAGGTCTAT TATATCAAGA TCCAGGGAAG CACCAGCATC CAGGTCAAAC AGGAAACGGA CGAGGCAGAA CCGAAAGCTG TCGACCTGGT TCTTTTCTCC AGAGAAATGA CTGCTATTCC TTTGCCGACG GAAGAGCAGG GTTCTCAGCT GGAAAAATCA TTTGCCAGGA ACGTAACAAC CTTGCAGGAC AACCTGGAGG CGGACCCGTT CAAGAGGGAC CTCAAGCAGT TCGGCTATGA TTTTTTCAAA AACAGTCTTT CCACCGGAAT ATTGACCGAC AATCTTCCGG TTGGGGGAGA TTATATTATT GGCCCTGGCG ATTCTCTGCG CATCGACCTG TGGGGGTCAG TGCAGGCACG ATATGATGCA ACGGTGGACA GAAACGGTGA AATTTCTCTG CCAAAAGTCG GCACGGTAAA AGTCTGGGGG ATCAGTTATG CCCAGGCTAA AGATGTCATC AACAAGGCTA TTTCCCGCTA CTTTAAGGGG TACGAACTGA ACGTCACGCT TGGCAGATTA CGCACCATTC AGGTGTTTGT AGTCGGCGAG GTGGAATCTC CCGGCACGTA TTCGGTCAGC TCGCTTGCGA CAGTCATAAA CGCTTTGTCT CAGGCGGGCG GCCCCTCATT GAACGGAAGC CTGCGCACCA TCAGACTGCT CAGGGGGGGG AAGGTCGTTC AGGAAATCGA CCTTTACGAT ATGCTCCTCG GTGGCGACCG GAGCAAAGAC CTGCGTCTGG AAAACGGCGA CACCATCTTT GTTCCTGTCA TAGGTCCTGT TGCGGCAGTG GCCGGTGAGG TGAAGCGACC GGGGATCTAC GAATTAAAGG GGACGACGAA CCTCGTTCAG ATTCTGCAGC TCGCGGGCGG CATTGCCGCG TCCGGGGATA CCGGCAGGCT GCAGGTAGAA AGGATTGAAG GGAATAGCGC GCGTATTGTC CTCGATTATG AGCCGAAGGC CGGCCAGATG GAGGAAACTC TTGCAAAGGT GACGATCCAC GACAGGGACA TGATAAAGGT CTTTCCCGTG TTCAAGGCAT TACGTCATGT GGTCGGCCTC AAGGGCAATG TGGCGCGGCC GGGCGAATAC CAGTACAAAG ACGGGATGCG AGTAACCGAT ATAATCCCTT CTTACACGGC GCTCTTGCCG GATTCCTATC TGGAGTCCGC GGAAATCTCC CGCCTGGTTT CCCCGGATTT CCACAAGGAA ACATTGTCGA TCAATCTGCG AAAAGCCATG GAAGGCGACC AGAAGGAAAA TATCCTGCTT CAGGAGCAGG ATACCATCAG GGTGTTTTCC CGCGGAGAGA TGGAAGAAAA GCCGGTGGTT TCAATCAATG GACAGGTGCT GAATCCAGGC ACCTATGATT ACTACCAGCG AATGACGGTG CGTGATCTGG TGACCGCTGC CGGCAGCCTC AAGAGAAACG CATATCTCGA TAACGCCGAG TTGACACGTA TTGATGTGGT GCATGGCAAA GCCAATTCAA TACGCGTGGA TATCAACCTG AAAAAAGCTA TGTCAGGAGA TCCGGAACAG AATATTCAGC TTCAGCCCGA TGATGTGCTT ATTGTGCGCG GCGTTGTCGA GTGGCTTGAT GCAACCGATA GATTCGTTAC CCTTAAGGGC GAAGTCAGGT TTCCCGGCAT TTATTCCATT GCCAAGGGCG AGAAGCTTGA TTCCGTGATT TCCAGGGCCG GTGGTTTTAC CGACAAGGCA TATCTGAAAG GGGCCAAGTT CACAAGGAAA TCGGTTCAGG AAAGCCAGCA AAAGCGGATG GATGAAGTTA TTTCCCGCAG CGAACAGGAC ATCTTGAAGA AGCAGGGTGA ACTCGCTTCG CTGGCTTCTT CAAAGGAAGA GCTTGAAGCA ACCAAGGCGT CGTTGGAAGG GTTGATGAAA GGGTTGGAGA AATTGAAGTT GGTAAAAGCC GAAGGTCGGG TGGTAGTGCG TCTTGTACCT TTAAACGAGC TGCAAAAGAG CCCCTATGAC CTTGAAATGA TGGGGGGCGA TATCCTCGAC ATACCACAAA CGCCCAATGT TATCAATGTT ATGGGGCAGG TATACAATCC GACGACTTTT GTCCATATGG CCGGCGGCAG TATTGCATCC TATTTGAAAA ATGCCGGCGG ACCGACCAGG GATGCAGAGG AAGACGAAAT GTACATCATC AAGGCGGATG GTTCCGTAGA CAGTCGCCAA CAGGCCACGT TCGGCATCCA TTGGGACGAA GTGTCGAAGA GCTGGACCTT CGGCAGCTTC ATGTCCAAGA CCATGGATCC GGGTGACACC CTGGTGGTGC CGCAGAAACT GGAGCGCACG GCATGGCTGA GGGAAATCAA GGATATAACC ACCATCCTGT CGCAGGTGGC GCTGACCGCA GCCACCGTCT TCATCGGACT CAAATAA
|
Protein sequence | MKKILSNSML TLLLLLVFVP AFALTADEMK KIEEQQGMYT GAEKQGIPLG ELNELKGRST SDITKGVPSD KDKSLTEPPG NKARIFMKTE PGDGLISLSW DIKGLQQKPG DPPLKYTLFY GTESGSYDKK LDVGNVRDYK LRGLKNHQVY YIKIQGSTSI QVKQETDEAE PKAVDLVLFS REMTAIPLPT EEQGSQLEKS FARNVTTLQD NLEADPFKRD LKQFGYDFFK NSLSTGILTD NLPVGGDYII GPGDSLRIDL WGSVQARYDA TVDRNGEISL PKVGTVKVWG ISYAQAKDVI NKAISRYFKG YELNVTLGRL RTIQVFVVGE VESPGTYSVS SLATVINALS QAGGPSLNGS LRTIRLLRGG KVVQEIDLYD MLLGGDRSKD LRLENGDTIF VPVIGPVAAV AGEVKRPGIY ELKGTTNLVQ ILQLAGGIAA SGDTGRLQVE RIEGNSARIV LDYEPKAGQM EETLAKVTIH DRDMIKVFPV FKALRHVVGL KGNVARPGEY QYKDGMRVTD IIPSYTALLP DSYLESAEIS RLVSPDFHKE TLSINLRKAM EGDQKENILL QEQDTIRVFS RGEMEEKPVV SINGQVLNPG TYDYYQRMTV RDLVTAAGSL KRNAYLDNAE LTRIDVVHGK ANSIRVDINL KKAMSGDPEQ NIQLQPDDVL IVRGVVEWLD ATDRFVTLKG EVRFPGIYSI AKGEKLDSVI SRAGGFTDKA YLKGAKFTRK SVQESQQKRM DEVISRSEQD ILKKQGELAS LASSKEELEA TKASLEGLMK GLEKLKLVKA EGRVVVRLVP LNELQKSPYD LEMMGGDILD IPQTPNVINV MGQVYNPTTF VHMAGGSIAS YLKNAGGPTR DAEEDEMYII KADGSVDSRQ QATFGIHWDE VSKSWTFGSF MSKTMDPGDT LVVPQKLERT AWLREIKDIT TILSQVALTA ATVFIGLK
|
| |