Gene Gura_3898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_3898 
Symbol 
ID5166923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp4548131 
End bp4550077 
Gene Length1947 bp 
Protein Length648 aa 
Translation table11 
GC content54% 
IMG OID640551380 
Producttype II secretion system protein E 
Protein accessionYP_001232620 
Protein GI148265914 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000434114 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTAACG ACGCGATTAT CCCGAATGGC AAATCATTAC AGGACAAGGC ATTGCTGACG 
TCCTCCGGCG TAGCCGTGAA AGATGCTGAC TCAGGCAGCG AGATTGCCGC TCTGCTTGCC
AAGGAAGGGT TTCTTTCTCC TCAAAATCTT GCCCATGCCA AAAGGGTCAA ATCCAAGCTA
TCTTCGCCGA AAACATTGAC ATCAGTGCTT CAGGAGCTGG GTTTCATCAG CAAAGAGCAA
CTTCGCGACG CCCTGCTGAA AAACCTGGTC TCGGTCAGAA TCGGAGATCT TCTAGTCGAA
CTGGGTCATT TGAAACCGGC GGATCTGCAA GCGGCCCTCG GCATCCAGAA AGAATCGAAC
GGCGCAAAGA GACTCGGCGA GGTCCTCGTT GACAACCGCT TCATCGACGA ATTGACGTTC
GCCGAAACCC TCGCCTTTCA GCTCGGCTTC CCATGCCTCG ATGTGGACAT AGCGGCCATC
GACCGCTCCA TCCTCTCCAG AGTGCCGCTT CAGACGCTCT CCGACCACAA TTTCATACCG
ATAACGGCAA AGGACGGAAA GGTGCTGGTG GCATTTGCCG ACCCACTGGA CGCGCAGGAC
CGGGCGGTCG CGGAAAAGAT CTTCGGCAAC TCCATGGATT TCGCCATATC GACACGCAAG
GGGATCCGCG AGGCCATCGC CTTCTTCAAA CGGAGCGGCA CACGGACTGA CACCACGGCT
GTCGACGAAA ATACCATCAT GGGGATCGTC AATGCGTTAT TCGAGGAGGC GGTCAAGGAA
GCGGTCAGCG ATATCCACAT CGAGCCGATG AAGGACCGGC TTCGCGTCCG TTTTCGCCGC
GACGGCGTCT TACTGCTCCA TAAGGATTTT CCCAAGGAGC TGGCCCCGCC GATCAGCAGT
CGGATCAAGA TCCTTGCCGA GGCTGATATT GCCGAAAGAA GACGCCATCA GGACGGCAGA
ATCCTCTACG AGAGCGACCA GAACGGTTTC ACCCTCGACC TGCGCGTTTC GTTCTACGTC
ACCATCTATG GCGAAAAAAT AGTGCTGCGG CTTTTAAATA AAAAGGGCGA ACTCCTGGAC
ATCAAAGATA TCGGCATGCC GCCACGCATG CTGGAACGGT TCCTGGACGA TGCGGTTGAT
ACGCCGAGCG GCGTGCTCAT CATCACCGGC CCTACCGGTT CCGGCAAGAC CTCGACCCTC
TACAGTTGTG TTCACCACAT GAACAACCTC AACACATCCA TCATCACCGC GGAAGACCCG
GTAGAGTACA TCATAGACGG CATCTCCCAG TGCTCCATCA ACACAAAAAT AGGCGTCACC
TTTGAAGAAA CGCTTCGCCA CATTGTACGT CAGGACCCGG ACATCATCGT ACTCGGCGAA
ATCCGTGACA CCTTTTCAGC GGAAACGGCC ATCCAGGCCG CACTCACCGG CCACAAGGTC
CTTACCACCT TCCACACCGA AGACAGTATC GGAGGACTTC TCCGGCTGAT GAACATGAAT
ATAGAAGCGT TTCTCATCTC CTCAACGGTA GTCTGCGTTC TGGCGCAAAG ACTGCTCCGG
AAAGTCTGTC CACACTGCGC CGAACCGTAC ATCCCGACCC CGACCGAACT CCGCCGCCTT
GGTTACGGCA ATGAAGAACT GAAAGGTAAT GAGTTCAAGA TCGGGCGGGG CTGCAACCAC
TGCCGGTTCA GCGGCTATCG CGGTCGAGTC GGAATTTTTG AAATGTTGGT ATTAAACGAA
ATGGTCAAAG ACGCTATTCT CAGTAAAAAA ACGTCCTACG AAATCAGACG TATCAGCACT
GAAACTTCGG GCCTCGTCAC ACTCATGGAA TCGGGTTTGT CAAAGGCGGC AAAGGGATTG
GTTTCCCTTC CTGACGCCAT CAGGATGCTG CCCCGATTGG GAAAACCGCG ACCGCTGAAT
GAAATTCGCA GACTGCTGGG AGAATAA
 
Protein sequence
MANDAIIPNG KSLQDKALLT SSGVAVKDAD SGSEIAALLA KEGFLSPQNL AHAKRVKSKL 
SSPKTLTSVL QELGFISKEQ LRDALLKNLV SVRIGDLLVE LGHLKPADLQ AALGIQKESN
GAKRLGEVLV DNRFIDELTF AETLAFQLGF PCLDVDIAAI DRSILSRVPL QTLSDHNFIP
ITAKDGKVLV AFADPLDAQD RAVAEKIFGN SMDFAISTRK GIREAIAFFK RSGTRTDTTA
VDENTIMGIV NALFEEAVKE AVSDIHIEPM KDRLRVRFRR DGVLLLHKDF PKELAPPISS
RIKILAEADI AERRRHQDGR ILYESDQNGF TLDLRVSFYV TIYGEKIVLR LLNKKGELLD
IKDIGMPPRM LERFLDDAVD TPSGVLIITG PTGSGKTSTL YSCVHHMNNL NTSIITAEDP
VEYIIDGISQ CSINTKIGVT FEETLRHIVR QDPDIIVLGE IRDTFSAETA IQAALTGHKV
LTTFHTEDSI GGLLRLMNMN IEAFLISSTV VCVLAQRLLR KVCPHCAEPY IPTPTELRRL
GYGNEELKGN EFKIGRGCNH CRFSGYRGRV GIFEMLVLNE MVKDAILSKK TSYEIRRIST
ETSGLVTLME SGLSKAAKGL VSLPDAIRML PRLGKPRPLN EIRRLLGE