Gene Gura_2236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_2236 
Symbol 
ID5162638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp2594078 
End bp2596924 
Gene Length2847 bp 
Protein Length948 aa 
Translation table11 
GC content52% 
IMG OID640549730 
Productpolysaccharide export protein 
Protein accessionYP_001230992 
Protein GI148264286 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.987184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TCTTGAGTAA CAGTATGCTG ACCTTGCTTC TGCTCCTTGT TTTCGTGCCT 
GCATTTGCGT TGACGGCAGA TGAAATGAAA AAAATCGAAG AACAGCAGGG TATGTACACA
GGGGCGGAAA AGCAGGGAAT CCCTCTTGGA GAACTGAATG AATTGAAGGG GAGAAGTACA
TCTGATATCA CTAAGGGGGT CCCTTCCGAT AAGGACAAAT CTTTAACGGA GCCGCCCGGT
AATAAAGCCA GGATTTTTAT GAAAACCGAG CCGGGCGACG GGCTCATTTC CCTTAGTTGG
GACATCAAGG GACTTCAACA GAAACCTGGC GATCCGCCGT TGAAATATAC CCTGTTTTAC
GGAACCGAGT CCGGGAGTTA TGACAAAAAA CTCGATGTTG GTAATGTACG GGATTATAAA
CTCCGGGGAT TGAAAAACCA CCAGGTCTAT TATATCAAGA TCCAGGGAAG CACCAGCATC
CAGGTCAAAC AGGAAACGGA CGAGGCAGAA CCGAAAGCTG TCGACCTGGT TCTTTTCTCC
AGAGAAATGA CTGCTATTCC TTTGCCGACG GAAGAGCAGG GTTCTCAGCT GGAAAAATCA
TTTGCCAGGA ACGTAACAAC CTTGCAGGAC AACCTGGAGG CGGACCCGTT CAAGAGGGAC
CTCAAGCAGT TCGGCTATGA TTTTTTCAAA AACAGTCTTT CCACCGGAAT ATTGACCGAC
AATCTTCCGG TTGGGGGAGA TTATATTATT GGCCCTGGCG ATTCTCTGCG CATCGACCTG
TGGGGGTCAG TGCAGGCACG ATATGATGCA ACGGTGGACA GAAACGGTGA AATTTCTCTG
CCAAAAGTCG GCACGGTAAA AGTCTGGGGG ATCAGTTATG CCCAGGCTAA AGATGTCATC
AACAAGGCTA TTTCCCGCTA CTTTAAGGGG TACGAACTGA ACGTCACGCT TGGCAGATTA
CGCACCATTC AGGTGTTTGT AGTCGGCGAG GTGGAATCTC CCGGCACGTA TTCGGTCAGC
TCGCTTGCGA CAGTCATAAA CGCTTTGTCT CAGGCGGGCG GCCCCTCATT GAACGGAAGC
CTGCGCACCA TCAGACTGCT CAGGGGGGGG AAGGTCGTTC AGGAAATCGA CCTTTACGAT
ATGCTCCTCG GTGGCGACCG GAGCAAAGAC CTGCGTCTGG AAAACGGCGA CACCATCTTT
GTTCCTGTCA TAGGTCCTGT TGCGGCAGTG GCCGGTGAGG TGAAGCGACC GGGGATCTAC
GAATTAAAGG GGACGACGAA CCTCGTTCAG ATTCTGCAGC TCGCGGGCGG CATTGCCGCG
TCCGGGGATA CCGGCAGGCT GCAGGTAGAA AGGATTGAAG GGAATAGCGC GCGTATTGTC
CTCGATTATG AGCCGAAGGC CGGCCAGATG GAGGAAACTC TTGCAAAGGT GACGATCCAC
GACAGGGACA TGATAAAGGT CTTTCCCGTG TTCAAGGCAT TACGTCATGT GGTCGGCCTC
AAGGGCAATG TGGCGCGGCC GGGCGAATAC CAGTACAAAG ACGGGATGCG AGTAACCGAT
ATAATCCCTT CTTACACGGC GCTCTTGCCG GATTCCTATC TGGAGTCCGC GGAAATCTCC
CGCCTGGTTT CCCCGGATTT CCACAAGGAA ACATTGTCGA TCAATCTGCG AAAAGCCATG
GAAGGCGACC AGAAGGAAAA TATCCTGCTT CAGGAGCAGG ATACCATCAG GGTGTTTTCC
CGCGGAGAGA TGGAAGAAAA GCCGGTGGTT TCAATCAATG GACAGGTGCT GAATCCAGGC
ACCTATGATT ACTACCAGCG AATGACGGTG CGTGATCTGG TGACCGCTGC CGGCAGCCTC
AAGAGAAACG CATATCTCGA TAACGCCGAG TTGACACGTA TTGATGTGGT GCATGGCAAA
GCCAATTCAA TACGCGTGGA TATCAACCTG AAAAAAGCTA TGTCAGGAGA TCCGGAACAG
AATATTCAGC TTCAGCCCGA TGATGTGCTT ATTGTGCGCG GCGTTGTCGA GTGGCTTGAT
GCAACCGATA GATTCGTTAC CCTTAAGGGC GAAGTCAGGT TTCCCGGCAT TTATTCCATT
GCCAAGGGCG AGAAGCTTGA TTCCGTGATT TCCAGGGCCG GTGGTTTTAC CGACAAGGCA
TATCTGAAAG GGGCCAAGTT CACAAGGAAA TCGGTTCAGG AAAGCCAGCA AAAGCGGATG
GATGAAGTTA TTTCCCGCAG CGAACAGGAC ATCTTGAAGA AGCAGGGTGA ACTCGCTTCG
CTGGCTTCTT CAAAGGAAGA GCTTGAAGCA ACCAAGGCGT CGTTGGAAGG GTTGATGAAA
GGGTTGGAGA AATTGAAGTT GGTAAAAGCC GAAGGTCGGG TGGTAGTGCG TCTTGTACCT
TTAAACGAGC TGCAAAAGAG CCCCTATGAC CTTGAAATGA TGGGGGGCGA TATCCTCGAC
ATACCACAAA CGCCCAATGT TATCAATGTT ATGGGGCAGG TATACAATCC GACGACTTTT
GTCCATATGG CCGGCGGCAG TATTGCATCC TATTTGAAAA ATGCCGGCGG ACCGACCAGG
GATGCAGAGG AAGACGAAAT GTACATCATC AAGGCGGATG GTTCCGTAGA CAGTCGCCAA
CAGGCCACGT TCGGCATCCA TTGGGACGAA GTGTCGAAGA GCTGGACCTT CGGCAGCTTC
ATGTCCAAGA CCATGGATCC GGGTGACACC CTGGTGGTGC CGCAGAAACT GGAGCGCACG
GCATGGCTGA GGGAAATCAA GGATATAACC ACCATCCTGT CGCAGGTGGC GCTGACCGCA
GCCACCGTCT TCATCGGACT CAAATAA
 
Protein sequence
MKKILSNSML TLLLLLVFVP AFALTADEMK KIEEQQGMYT GAEKQGIPLG ELNELKGRST 
SDITKGVPSD KDKSLTEPPG NKARIFMKTE PGDGLISLSW DIKGLQQKPG DPPLKYTLFY
GTESGSYDKK LDVGNVRDYK LRGLKNHQVY YIKIQGSTSI QVKQETDEAE PKAVDLVLFS
REMTAIPLPT EEQGSQLEKS FARNVTTLQD NLEADPFKRD LKQFGYDFFK NSLSTGILTD
NLPVGGDYII GPGDSLRIDL WGSVQARYDA TVDRNGEISL PKVGTVKVWG ISYAQAKDVI
NKAISRYFKG YELNVTLGRL RTIQVFVVGE VESPGTYSVS SLATVINALS QAGGPSLNGS
LRTIRLLRGG KVVQEIDLYD MLLGGDRSKD LRLENGDTIF VPVIGPVAAV AGEVKRPGIY
ELKGTTNLVQ ILQLAGGIAA SGDTGRLQVE RIEGNSARIV LDYEPKAGQM EETLAKVTIH
DRDMIKVFPV FKALRHVVGL KGNVARPGEY QYKDGMRVTD IIPSYTALLP DSYLESAEIS
RLVSPDFHKE TLSINLRKAM EGDQKENILL QEQDTIRVFS RGEMEEKPVV SINGQVLNPG
TYDYYQRMTV RDLVTAAGSL KRNAYLDNAE LTRIDVVHGK ANSIRVDINL KKAMSGDPEQ
NIQLQPDDVL IVRGVVEWLD ATDRFVTLKG EVRFPGIYSI AKGEKLDSVI SRAGGFTDKA
YLKGAKFTRK SVQESQQKRM DEVISRSEQD ILKKQGELAS LASSKEELEA TKASLEGLMK
GLEKLKLVKA EGRVVVRLVP LNELQKSPYD LEMMGGDILD IPQTPNVINV MGQVYNPTTF
VHMAGGSIAS YLKNAGGPTR DAEEDEMYII KADGSVDSRQ QATFGIHWDE VSKSWTFGSF
MSKTMDPGDT LVVPQKLERT AWLREIKDIT TILSQVALTA ATVFIGLK