Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_0162 |
Symbol | |
ID | 5163219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | + |
Start bp | 221951 |
End bp | 225001 |
Gene Length | 3051 bp |
Protein Length | 1016 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640547661 |
Product | hypothetical protein |
Protein accession | YP_001228951 |
Protein GI | 148262245 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGATGAGG TCATTGGTGT ACCGCATGCC TCGGCCATGA GGATCGGGTT CAACAAATTG AACCTATCCG GTTTTTTCTG CATAGACACC ATTCCCACCA TTGCATTTCT TTCTCAAGAA CGAATCAATC GACTCGAAAT AACCAGGGTA CACAATGCCC TCTGGAACCA GGGACTGGTC AGCCTCCTTT TGGTCACGCT CCCCGACGAG GTCCGTGCTT ACTCTCTGGC CAGGAAACCA ACCGCAGCGG ACGAAGACCT CGACGACGAC AAAAAAGACA ACCGTCTTAT TGATTGCCTT GATCTATTGA AAGATGCGCT TGAAATCTCG CATCTGATTA CCGGGGTTGA ATCCGGCCGA TATTTTCAAA AGAATAAAGA GTACTTTGCA CAAAATGAAA AGATCGATGC ATTACTTCTG TCAAATTTAA GGGCCACGGA AAACGAGCTG ACCTCTGCGC CACTCAATCT GTCAACCGAA TCCGCTCAGG CACTGCTGCT GCAGATTACC TTTATTGCCT ATCTCGAAGA CCGGGGGATT ATCGATCCTG ACTATTTCCG GGAGGCATTG AAAGGCAAGG GGATATCTAC TTTAGAACAG TTGCTGGACG AAAATGATCC TGAAAACCTG AATTCGCTTT TCGAAAAGCT CCATGGAAAT TTTAACGGCG ACATATTCTT TGCGCCATGT GCATTTGACG GCGAAGCAAA GGCACCAGTA CTCAATGCGG GCCATCTGCG AAGTCTTGCG GAATTCAGAA AAGGGGGTGT CGATAAGACT ACCGGCCAAG GTCGTTTCTG GCCGTATAAC TTCAAATATA TCCCCGTTGA GTTGATAAGC GCCATTTACA ACCGCTTCCT TGGCGACAGG CCGGTGGAGC GCAAAGTGAG CGGAGCCTTT TATACTCCTC ACTTCCTCGC CGATTTAACG GTCAACCAAT TGTGGGAAGA ATTGACGCCG GCAATCCGAT CAAGTCAAGA CTTCACGGTG CTGGATCCGG CCTGCGGCTC TGCGATATTC CTTGTCAGAA TTTTTCAACG AATGGTGGAG GATTGGCGTT TTCTTCATCC CGGCGGGACT CCCGACTGGG ATACTCTGGT TGCCATCGTC GAAAGACTCA ACGGTTGGGA CAAAGAAACC AGCGCTGTAC GCATCGGCAT TTTCTCCCTC TATATTGCTT TGCTTGAAGA AGTCGAACCA GCTGCAATTC TCAAACTGTT GGCGGAACGG AAATTGTTGC CCCCACTTTT CAGGAAGACG ATGTGCGACA GGGATTTCTT TGGCAAAGAT ACGCCAAATA CCAAATTCGA TTTGGTATTT GGCAATCCTC CATGGGTAAG CCGTAAGGAA GATCAGGTCG TTTCTGCCAC TGAATGGTGC AAAGCACACG AACTACCGAT GCCGGCGAAA GAACTGGCCT GGGCCTTTGT CTGGAAAAGC ATCCAACATA CTAAATCAGA GGGGATGATC GGCTTACTGC TGCCGGCCAT GGGCGTTTTA CTCAACCATA GCGAACCTTC CATTCAAGCC AGGGGGCTGT GGCTGAAACA GGTACTTCTG AGTAAAGTCA TCAATTTTTC TGATATCTGC TTCCTTCTGT TCGACGGCGC CAAGCGACCG ACGGCTTTGT GTATCTTTCG ACCGTCCGAC AAAAAGCTTA GCGATTATCG GTTCGACTAC TGGTGCCCTA AAGCCGACCC GTTGCTGCAA CAAACCCGCA TGCTTACGTT GAACAGAGGC GATAAGCTAT CGTTTAAACT ATCCACGGTG CTCCATGATC CGGGCTCTTG GGGGCGGCAT CTCTGGATGA CAAATCGTGA CATGAAACTG CTCGGCTGGA TTGGCGGGTT GTCTCGTCTG GAGAGAAAAT TAGCCACCTA TAAAGAATCA CAGCAGAAAG AGTTCGATAA GAAAACAAAA GTTTGGATAA TTGGACAAGG ATTCCAGCCT TATAATAGTG AAAGAAGCAA CAAGAATACT AAACCGAAAA AATCTAACCG GGTCGAAAAA GTTCCATTTT TGGATGCAAA TAAGTTTAAC GAATATATTA TCCCTACGAT TTTAATTGAC AAGCCTTGGC ATACTTCCTT AGTTCGTAGA CTTGGGAATC AAGATGGTTT TATTGGACCG CATGTCCTTA TACCAAAAGG CGTACGAAGG AAAACTGGAT TATTGAGAGC GGCATACGTA GAGCATGATT TATGTTTTAC AGATGCAATA CAGGCGATCA ATTTTCCTGA GACAGATATT CAAAGACTTA AACTGCTAAC GGTTATACTG AATAGTCATT TTGCAGCATG GTTCTATTTT CATGAAACCG CTAGCCTTGG AGCTGACCGT GCCTTGGTTC ATGAAGAACA GCTATTGGCG CTACCCTTTC CAGAAATAGA CGAGTTGCCT GATCCAAAAG CGGCCAGAAA TGCTGCGGAT AAGATTGTCA TGATTTTTGA TGATCTGCTT TTGCATAAAG ACACATATTC ACAAGGCCAA TTCCCCGACA ACGAAACAAT TGAAGAGGTT AACCGGCTCG TCTATCAATA CTATGGCCTC ACTGAATCGG AATTCATACT CATAGAAGAC ACCCTCAACT ATATTCTGCC AAGCATTCAG CCTCGAAACA AAAGCTTCCC GCATCTTTGG AACAAAGCCG GCAAGAAGCA ATGGCAAGAA TACATGATCA CTCTCTTGTC CGCTCTTGAA GAATGGCTGG ACAACGGGAG CCACCTGTCC GCTACATTGA TTACCGACAA CCCAGACGTG CTGCTGCTCG GCCTGAAAAT CGAGCAAAGC CGCCCCAAGC AGTCCATTAC CTTCTGTGAA CAGGGGGGCG ATTTCAACAA AACGCTTTCA AAAATCAATG AAGAGCTTAA ACAGCAGGTG TCGAGAAACA TTCAGCTCAT GCCCGATCTC CGCATTTTCA TCGCCGACAC CCTCTACCTG TTAAAGCCGC GCACCATGCG CTATTGGCTT AAGAGCACGG CTCTGAACGA TGCTGATGCA ATTATTGCCG ATCTTCAAAT CCAGAAATTC CATCATGGGT ACAAAGGATA G
|
Protein sequence | MDEVIGVPHA SAMRIGFNKL NLSGFFCIDT IPTIAFLSQE RINRLEITRV HNALWNQGLV SLLLVTLPDE VRAYSLARKP TAADEDLDDD KKDNRLIDCL DLLKDALEIS HLITGVESGR YFQKNKEYFA QNEKIDALLL SNLRATENEL TSAPLNLSTE SAQALLLQIT FIAYLEDRGI IDPDYFREAL KGKGISTLEQ LLDENDPENL NSLFEKLHGN FNGDIFFAPC AFDGEAKAPV LNAGHLRSLA EFRKGGVDKT TGQGRFWPYN FKYIPVELIS AIYNRFLGDR PVERKVSGAF YTPHFLADLT VNQLWEELTP AIRSSQDFTV LDPACGSAIF LVRIFQRMVE DWRFLHPGGT PDWDTLVAIV ERLNGWDKET SAVRIGIFSL YIALLEEVEP AAILKLLAER KLLPPLFRKT MCDRDFFGKD TPNTKFDLVF GNPPWVSRKE DQVVSATEWC KAHELPMPAK ELAWAFVWKS IQHTKSEGMI GLLLPAMGVL LNHSEPSIQA RGLWLKQVLL SKVINFSDIC FLLFDGAKRP TALCIFRPSD KKLSDYRFDY WCPKADPLLQ QTRMLTLNRG DKLSFKLSTV LHDPGSWGRH LWMTNRDMKL LGWIGGLSRL ERKLATYKES QQKEFDKKTK VWIIGQGFQP YNSERSNKNT KPKKSNRVEK VPFLDANKFN EYIIPTILID KPWHTSLVRR LGNQDGFIGP HVLIPKGVRR KTGLLRAAYV EHDLCFTDAI QAINFPETDI QRLKLLTVIL NSHFAAWFYF HETASLGADR ALVHEEQLLA LPFPEIDELP DPKAARNAAD KIVMIFDDLL LHKDTYSQGQ FPDNETIEEV NRLVYQYYGL TESEFILIED TLNYILPSIQ PRNKSFPHLW NKAGKKQWQE YMITLLSALE EWLDNGSHLS ATLITDNPDV LLLGLKIEQS RPKQSITFCE QGGDFNKTLS KINEELKQQV SRNIQLMPDL RIFIADTLYL LKPRTMRYWL KSTALNDADA IIADLQIQKF HHGYKG
|
| |