Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_2004 |
Symbol | |
ID | 5166982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | + |
Start bp | 2321079 |
End bp | 2322305 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640549498 |
Product | hypothetical protein |
Protein accession | YP_001230767 |
Protein GI | 148264061 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01905] doubled CXXCH domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAGAC TCAAGCTATC TACGTTTCTC GTTGCCGCTG TTCTTGCAGC CGGAGTCCCC GCCGCTCAGG CGACGCCATT CCATGCCGGC GGCACCGGCA ACTGCGATGG ATGTCACGGT CCGAACTTTA CCGCAGGAAG CTACTCAACA TTGCGCGGCG TTGATCCTGG CTCGACCTGC CTTCGTTGCC ATGGCGCAGC CAGGCCGACT GAGCATCAAA TAGCCACTCA TCCCGTTCCG CCCAAGGGAA TTCCTCCCGT GTCCCTTACC CCCGGCGGCG ATTTTGCTTA TCTGCGGAAA AACTATTTCT GGGGCGATTC CAACGGCAAG CGTGGCATAA GTCCCGGAGA AAGACACGGA CACAACATCG TTGCAGCCGC TTACGGCTAT TCCCGGGACA CGGCCCTCCT CGGTTCCCCG GGCGGCTCAT ATCCGGCTGA TGCCCTATCC TGCATCAGCT GCCATGATCC CCACGGGAAC TACCGGGTGC TGGACAGATA CGGCACTGTT TCGAGCGAGG GGAACCCCAT CGGCGAGGCG GGTTCTTACG GGGCAGCAGC AACAAGCGCC AGTTCCGTCG GCACTTACCG CTTGCTGGCC GGCAAGGGAT ATCAAACAAA ATCTGCCGGG CACGTTTTTT CCTACGATCC GCCGATGGCC GTTTCCCCCA CCAGCTACAA CAGATCCGAG GCAAGTTCTG ATACGAGGGT TGCCTACGGC AAAGGGGTGT CGAAATGGTG CGCAAACTGC CACGAAGGTT TCCTGTCTGG GACGAGCCAT ATCCACCCGG CTGACGTGGA GCTTGGTGTG GCAATAGCCG CGACCTACAA CAATTACGTG AAGTCCGGCG ATCTTACCGG CATCCGGGCC ACCGCCTATA CCTCGCTGGT CCCGTTCCAG AGTGGCGAAG TGACCGACCC GCAGCAACTG TCTGCCGAGT TGACTTCTAC CGCCGGTCCC AACCCGGACG ACAGGATAAC CTGCCTTACC TGCCATCGGG CCCATGCCTC CGGCTGGGAC AGCATAGGCA GGTGGAACAT GAAGGGGGAC TTCCTGACCG TGGCCGGCGC GTATCCGGGA ATCGACACCA ATGGTACGGG GAATTACGGC GAGAACTCCA CCGGCAAACT CAGAACCGAG TATCAGGCGG CCATGTACGG CCGGGACGCA TCCGGATTCG CAACTTTTCA ACGGCAGCTT TGCGATAAGT GCCATGCGAA GGATTGA
|
Protein sequence | MNRLKLSTFL VAAVLAAGVP AAQATPFHAG GTGNCDGCHG PNFTAGSYST LRGVDPGSTC LRCHGAARPT EHQIATHPVP PKGIPPVSLT PGGDFAYLRK NYFWGDSNGK RGISPGERHG HNIVAAAYGY SRDTALLGSP GGSYPADALS CISCHDPHGN YRVLDRYGTV SSEGNPIGEA GSYGAAATSA SSVGTYRLLA GKGYQTKSAG HVFSYDPPMA VSPTSYNRSE ASSDTRVAYG KGVSKWCANC HEGFLSGTSH IHPADVELGV AIAATYNNYV KSGDLTGIRA TAYTSLVPFQ SGEVTDPQQL SAELTSTAGP NPDDRITCLT CHRAHASGWD SIGRWNMKGD FLTVAGAYPG IDTNGTGNYG ENSTGKLRTE YQAAMYGRDA SGFATFQRQL CDKCHAKD
|
| |