Gene Gura_2048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_2048 
Symbol 
ID5163087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp2396175 
End bp2397896 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content51% 
IMG OID640549543 
ProductTPR repeat-containing protein 
Protein accessionYP_001230811 
Protein GI148264105 
COG category[N] Cell motility
[R] General function prediction only
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF
[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.732994 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA AGTATTGTGT TGTCAACCTC TGGTTCTTTG TGGTTCTCTC AGGATGCGCG 
ACCGGCCATG TAGCCGAGCC GCCTCTTCTT CAGGCTCAAG CCCTCCATCC CGACGTTAAT
ATCGCTGAAT CACGCTCCAT GTATATCTAT TCGCTCTCCC GTATCCATGT TCTGGAAGGT
GATTTCGATG GGGCTTTATC CCTCCTCCAG GCGGCGGTTG AGGCAGATCC CAAATCCGCT
TTTCTTCGCA AATCCATTGC TCAGGTCTAT TTACAGATGA ACAGGTTCCA GGACGCGCTG
GAATCTTGTC AAACCGCCAT CAAACTTGAT CCCGGCTTTG TCGAGGCACA GATCCTGGCA
GGAAATATTC TGGTCGGTCT GCAGCGGGAT AAAGAGGCCA TTCCTTACTA CAAAAAGGCC
CTTGAAATTG ACCCGTCCAA GGAAGACATC TACCTCCACC TGGCCATCGC CTACGTGAAG
GGGTTTGAAT ACGAGGAGGC TGTCAACACC CTTAAGGTGC TTCTCAAGGT CAATCCCGAT
TCCGCCATCG GTTATTACTA TCTGGGGAAG ACTTACGATC AGATGAAGCT TTCCAAGGAT
GCCGCCAACT ACTATAAAAA GGCGGTGGAG CTGAAGCCGG ATTTTGAGCA GGCTATCATT
GATCTGGGGA TTTCCCAGGA AATGCAGGGG CTCGCCGGTG AGGCAATTAA TACCTACAAT
GAATTGTTGC GGATCAATCC GGTTAATTAC AATGTCATTC AACATCTGGT TCAGCTGTAT
ATCCAGCAGA AGCGCCTTAA TGACGCCCTT ACGTTGTTGA AAAATATGGC CGACAGCGGT
ATCGGCGGAC AGGAAACACA CCGTAAGATC GGCCTCATTT ATCTGGAGAT GGAGCGTTAC
GACGACGCAA TCAAGGAATT TACCGAGATA CTCGGGCAGG AGCCGGACGC TCAGCAGGTT
CGATATTATC TGGCATCCAC TTATGAGGAG ATGGAAGATT TCGACCGTGC CATCGAAGAA
TTCAAGAAGA TCCCCCCGTC ATCAGCCCAT TATTTTGATG CTTTGGGACA TCTCGGCTTC
CTTTATAAGG AAAACGGAGA GCCGGAGAAG GGGATTGCAC TTCTTAAGGA AGCAATCACC
AACCAGCCGA ATCGAATCGA ACTTTATCTG AATCTTGCCG GACTCTATGA ATCGATGGAT
CAATTTGCCG AGGGGCTCCG GGTGTTGACG GATGTGGAAG GGAATTTCCC CAATGATCCT
CGGCTGAGCT TCCGCATGGG CGTTCTTTAT GACAAGATGG GTAACAAGGA CGAATCTATT
GCCCGGATGA AAAAGGTCAT TGCCCTGGCG CCGAACGATG CGCAGGCATT GAACTATCTC
GGCTATACCT ACGCAGAGCT TGGCGTCAAT CTGGATGAGG CGTTGCAGTA TCTGAACAAG
GCCGTTTTGC TCCGCCCGGA TGACGGCTTC ATTCTGGACA GCCTCGGTTG GGCCTATTAC
AAAATGAAGC GCTACGACCA GGCGGTGTTC CATCTGGAAC GGGCCGTCCA GCTGGTTGAC
GAGGACGCCA CCATAATTGG TCACCTGGCC GATGCATATT TTGCCAACAG GGAATACCGC
AAGGCGCTTA CACGTTATCG CCGCGTCCTG CAGCTGGAGC CTGAGCGCAA GGACATCGCC
GAGAAGATAA AGAAGATCAT GGCGGAGACC GGTGAAAAAT GA
 
Protein sequence
MKKKYCVVNL WFFVVLSGCA TGHVAEPPLL QAQALHPDVN IAESRSMYIY SLSRIHVLEG 
DFDGALSLLQ AAVEADPKSA FLRKSIAQVY LQMNRFQDAL ESCQTAIKLD PGFVEAQILA
GNILVGLQRD KEAIPYYKKA LEIDPSKEDI YLHLAIAYVK GFEYEEAVNT LKVLLKVNPD
SAIGYYYLGK TYDQMKLSKD AANYYKKAVE LKPDFEQAII DLGISQEMQG LAGEAINTYN
ELLRINPVNY NVIQHLVQLY IQQKRLNDAL TLLKNMADSG IGGQETHRKI GLIYLEMERY
DDAIKEFTEI LGQEPDAQQV RYYLASTYEE MEDFDRAIEE FKKIPPSSAH YFDALGHLGF
LYKENGEPEK GIALLKEAIT NQPNRIELYL NLAGLYESMD QFAEGLRVLT DVEGNFPNDP
RLSFRMGVLY DKMGNKDESI ARMKKVIALA PNDAQALNYL GYTYAELGVN LDEALQYLNK
AVLLRPDDGF ILDSLGWAYY KMKRYDQAVF HLERAVQLVD EDATIIGHLA DAYFANREYR
KALTRYRRVL QLEPERKDIA EKIKKIMAET GEK