Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_3898 |
Symbol | |
ID | 5166923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | + |
Start bp | 4548131 |
End bp | 4550077 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640551380 |
Product | type II secretion system protein E |
Protein accession | YP_001232620 |
Protein GI | 148265914 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000434114 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTAACG ACGCGATTAT CCCGAATGGC AAATCATTAC AGGACAAGGC ATTGCTGACG TCCTCCGGCG TAGCCGTGAA AGATGCTGAC TCAGGCAGCG AGATTGCCGC TCTGCTTGCC AAGGAAGGGT TTCTTTCTCC TCAAAATCTT GCCCATGCCA AAAGGGTCAA ATCCAAGCTA TCTTCGCCGA AAACATTGAC ATCAGTGCTT CAGGAGCTGG GTTTCATCAG CAAAGAGCAA CTTCGCGACG CCCTGCTGAA AAACCTGGTC TCGGTCAGAA TCGGAGATCT TCTAGTCGAA CTGGGTCATT TGAAACCGGC GGATCTGCAA GCGGCCCTCG GCATCCAGAA AGAATCGAAC GGCGCAAAGA GACTCGGCGA GGTCCTCGTT GACAACCGCT TCATCGACGA ATTGACGTTC GCCGAAACCC TCGCCTTTCA GCTCGGCTTC CCATGCCTCG ATGTGGACAT AGCGGCCATC GACCGCTCCA TCCTCTCCAG AGTGCCGCTT CAGACGCTCT CCGACCACAA TTTCATACCG ATAACGGCAA AGGACGGAAA GGTGCTGGTG GCATTTGCCG ACCCACTGGA CGCGCAGGAC CGGGCGGTCG CGGAAAAGAT CTTCGGCAAC TCCATGGATT TCGCCATATC GACACGCAAG GGGATCCGCG AGGCCATCGC CTTCTTCAAA CGGAGCGGCA CACGGACTGA CACCACGGCT GTCGACGAAA ATACCATCAT GGGGATCGTC AATGCGTTAT TCGAGGAGGC GGTCAAGGAA GCGGTCAGCG ATATCCACAT CGAGCCGATG AAGGACCGGC TTCGCGTCCG TTTTCGCCGC GACGGCGTCT TACTGCTCCA TAAGGATTTT CCCAAGGAGC TGGCCCCGCC GATCAGCAGT CGGATCAAGA TCCTTGCCGA GGCTGATATT GCCGAAAGAA GACGCCATCA GGACGGCAGA ATCCTCTACG AGAGCGACCA GAACGGTTTC ACCCTCGACC TGCGCGTTTC GTTCTACGTC ACCATCTATG GCGAAAAAAT AGTGCTGCGG CTTTTAAATA AAAAGGGCGA ACTCCTGGAC ATCAAAGATA TCGGCATGCC GCCACGCATG CTGGAACGGT TCCTGGACGA TGCGGTTGAT ACGCCGAGCG GCGTGCTCAT CATCACCGGC CCTACCGGTT CCGGCAAGAC CTCGACCCTC TACAGTTGTG TTCACCACAT GAACAACCTC AACACATCCA TCATCACCGC GGAAGACCCG GTAGAGTACA TCATAGACGG CATCTCCCAG TGCTCCATCA ACACAAAAAT AGGCGTCACC TTTGAAGAAA CGCTTCGCCA CATTGTACGT CAGGACCCGG ACATCATCGT ACTCGGCGAA ATCCGTGACA CCTTTTCAGC GGAAACGGCC ATCCAGGCCG CACTCACCGG CCACAAGGTC CTTACCACCT TCCACACCGA AGACAGTATC GGAGGACTTC TCCGGCTGAT GAACATGAAT ATAGAAGCGT TTCTCATCTC CTCAACGGTA GTCTGCGTTC TGGCGCAAAG ACTGCTCCGG AAAGTCTGTC CACACTGCGC CGAACCGTAC ATCCCGACCC CGACCGAACT CCGCCGCCTT GGTTACGGCA ATGAAGAACT GAAAGGTAAT GAGTTCAAGA TCGGGCGGGG CTGCAACCAC TGCCGGTTCA GCGGCTATCG CGGTCGAGTC GGAATTTTTG AAATGTTGGT ATTAAACGAA ATGGTCAAAG ACGCTATTCT CAGTAAAAAA ACGTCCTACG AAATCAGACG TATCAGCACT GAAACTTCGG GCCTCGTCAC ACTCATGGAA TCGGGTTTGT CAAAGGCGGC AAAGGGATTG GTTTCCCTTC CTGACGCCAT CAGGATGCTG CCCCGATTGG GAAAACCGCG ACCGCTGAAT GAAATTCGCA GACTGCTGGG AGAATAA
|
Protein sequence | MANDAIIPNG KSLQDKALLT SSGVAVKDAD SGSEIAALLA KEGFLSPQNL AHAKRVKSKL SSPKTLTSVL QELGFISKEQ LRDALLKNLV SVRIGDLLVE LGHLKPADLQ AALGIQKESN GAKRLGEVLV DNRFIDELTF AETLAFQLGF PCLDVDIAAI DRSILSRVPL QTLSDHNFIP ITAKDGKVLV AFADPLDAQD RAVAEKIFGN SMDFAISTRK GIREAIAFFK RSGTRTDTTA VDENTIMGIV NALFEEAVKE AVSDIHIEPM KDRLRVRFRR DGVLLLHKDF PKELAPPISS RIKILAEADI AERRRHQDGR ILYESDQNGF TLDLRVSFYV TIYGEKIVLR LLNKKGELLD IKDIGMPPRM LERFLDDAVD TPSGVLIITG PTGSGKTSTL YSCVHHMNNL NTSIITAEDP VEYIIDGISQ CSINTKIGVT FEETLRHIVR QDPDIIVLGE IRDTFSAETA IQAALTGHKV LTTFHTEDSI GGLLRLMNMN IEAFLISSTV VCVLAQRLLR KVCPHCAEPY IPTPTELRRL GYGNEELKGN EFKIGRGCNH CRFSGYRGRV GIFEMLVLNE MVKDAILSKK TSYEIRRIST ETSGLVTLME SGLSKAAKGL VSLPDAIRML PRLGKPRPLN EIRRLLGE
|
| |