Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_3334 |
Symbol | |
ID | 5166740 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | + |
Start bp | 3918757 |
End bp | 3920544 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640550820 |
Product | hypothetical protein |
Protein accession | YP_001232064 |
Protein GI | 148265358 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000666243 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAGC AACTTTGGAA GATGCTCCTT CTCACCTCGT TCATCCTGAG CGCCCTGGGG ATTTCGGGAT GTGGCGGCGG AGGGAGTACG GCAACGACAT CGGGAACCGC AATAACAGTA AGCGGTACCG CACTGGCAGG GGCGCCCCTT TTCGGGAACG CCTGGATCAA GGATGCGAAC GGCAAGAAAA AGGGGCCGGT TGCCATCGAC AAAAATGGAA ATTTCGCCTT CACCAACATG ACCGGCATGC AGGCCCCCTT CATCCTCCAG GCCGACGGCA CCGCCGGCAC CAACAGCTAC CGCCTCTGTT CCATGGCCAC CGGCGGCGGA ACCGCCAATA TCAACCCCAT GACCAACCTT GCCGTCGCCG CGGTGACCGG CAAAGACCCG GCTGCGGTCT TCACCGACCC GACGGCCAAC AACATCAAGA ACTCCATCAA CGACACGGCC GTGGCAGCCG CCATCGCACA GATCAAGGCG ATGCTCAAGC CGATCCTCGA TGCGTACAAC GCCTCTGCCG TCGACCCGCT GAAGGGTAAC GTAGCAGCCA CCAACTCCGG GTTGGACGGC GTCTTCGACG TGGTGAAGAT CAACGTCACT CCCGACAACA CCGGTGCGGC GCAGGTATCC GTCAGCAACA ACCTTACCAA CTCGACCATA TTGCAGACGC AAACGGTCTC CACGGCGGTC GCCAACACCA CACAGACGGC GGCGACAATC ACAACCCAGA CGACCGGCTT GTCCACCGAC GCCGCCAACC TGCAGGCAAT TACCGCGCAA CTTCAATTGC TGGCAACCGA GCTGGGCAAG ACCACCCCGA ACCTCGACCC CTTCTTTGCC ACCAATTTCG GGATCAACAG CGGGCTCGAC CGGGCCCAAT CCATCCTGCA ATTGGCCCCC CCGGGGAAAA TCACCGGCAT CTCCCCGATC AGCGTCGTGC AGAAAACCGC AAACGGCGCG AGCTTTGACT ACGAAGTCTC CTTCCTGGCT TACTTCGCCG ACGGCTCAAA CGGCGCCCCC GATGACAACT TCATCTTCAC CAATGAAGGC GGCGCCTGGA GGCTGAAAGG GAATAACTAT AAATCCTACG TCAGGATTCA ACCGCAAGCC TACAGATGGA TCGATGCGGT CGGCACGACG ACCGTCAAGA CCGGTCTCGA CGTCGAAGCC GAAGACCCGG GCGTGATCGG CATCGCCACC ATTACGATTA CCGGCCCCGG TCTCCCCGCC GCCGGACTCA CCATGACATC GGTCGGTGTG GGTGTCACCT ATTTCAATAT CATCCAGGCT CAACAGGACC CCACCCTGAA CACGCTGACC AACCAATGGA ATTTCCTCCC GCTGAGTGAT GCGACGATCA CTGGTACCTT TGCCGCCACG GCTGCGCCGT TCACCTACAC CTTTACGGTC AAGGATGCGA ACGGCGCCAC CATCGAGACC AGGACAAAGA AGCTCGCAGT GGGACCGCTC CTCTCTGCCA CACTTGACGC CACCTACTTC CCGACCATCA GTGGCCTGGC TTCCCAAGCC ATGTCCCTCT TGACCGGCAA GAGCTCGATT TCCTTCTTCT TTGCCAAGCC GACCGCCTAC ACGGTCCAGG AGCAGCGAGC CAATCTCAGT TTCTGGAACG AGACCAGCAA CGGATATTAC GATACCGAGC CTCTCCTTAC CGACACCCAG GCCACCATCA CAGGCGGTAT CCCCTCGACG CTCCAGGGGG CATGGCTCAG CATGGGAGCA AGGGATGGCA GCGGCCGGAG ATTTGATGCC GTCTTGATTT TCAAGTAG
|
Protein sequence | MKQQLWKMLL LTSFILSALG ISGCGGGGST ATTSGTAITV SGTALAGAPL FGNAWIKDAN GKKKGPVAID KNGNFAFTNM TGMQAPFILQ ADGTAGTNSY RLCSMATGGG TANINPMTNL AVAAVTGKDP AAVFTDPTAN NIKNSINDTA VAAAIAQIKA MLKPILDAYN ASAVDPLKGN VAATNSGLDG VFDVVKINVT PDNTGAAQVS VSNNLTNSTI LQTQTVSTAV ANTTQTAATI TTQTTGLSTD AANLQAITAQ LQLLATELGK TTPNLDPFFA TNFGINSGLD RAQSILQLAP PGKITGISPI SVVQKTANGA SFDYEVSFLA YFADGSNGAP DDNFIFTNEG GAWRLKGNNY KSYVRIQPQA YRWIDAVGTT TVKTGLDVEA EDPGVIGIAT ITITGPGLPA AGLTMTSVGV GVTYFNIIQA QQDPTLNTLT NQWNFLPLSD ATITGTFAAT AAPFTYTFTV KDANGATIET RTKKLAVGPL LSATLDATYF PTISGLASQA MSLLTGKSSI SFFFAKPTAY TVQEQRANLS FWNETSNGYY DTEPLLTDTQ ATITGGIPST LQGAWLSMGA RDGSGRRFDA VLIFK
|
| |