Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_0757 |
Symbol | |
ID | 5164363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | + |
Start bp | 896200 |
End bp | 897360 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640548256 |
Product | hypothetical protein |
Protein accession | YP_001229539 |
Protein GI | 148262833 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00827619 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGATAT TTGCAACCAT AGCATCGTCG CTGCTGTTAA TGCTTGTCAT CACCGCCTGT GGCGGCGGTT CCAGTCCCAC TCCCGTCGCC GCCAACACCA CCGTCTCCGG CATCGCCTCC AAGGGCCCGA TCGTCAACGG CGTAGTCAAG ATATATTCGG TCATAGACGG CGCCAAAAGC ACACTGCTCG CCCAGACCAC CACCGACGCC AACGGCAACT ACACCGCCAA TCTCGGCAGC TACGTGGGTC CGATCATAGT CGAGGCGAGC GGTTCCTACC TGGACGAAGC CACCGGCACC ACCAAGACCA TTCCAGCGGA TTCTCCCATC CACGCCGCGC TCCCCTTGGC GCAGGGCGCG GTCAACCTGC CGGTAACGGC CCTAACCGAA CTTGCCTACA TCAAGACCGG GGGCGACCTG ACCGCATCGG CAATCAGCAC AGCCAACACC CTGGTCTCCG ATCTCTTCAA GGTAGACATC ATCGCCACCT CGCCGGTTGC GCCTACGACT GAAGCTCTTA AAACCGCCAC CCAGGCCGAG AAGGACTACA CCCTGGCTTT GGCGGCCATT TCCCAGATGG CCAGCACGAC AACCGGAGCC AGCGACACCG ACAAACTTAA CAATGCGCTG TCGACCATGG GACAGGGCAT ATCATCCACC GGGATGACCT CCGATACCGC CGCCACGGTC CAGGCTGCCC TGACCACCTT TGTCACTACC AACGCCAACA ACAATACCGG CGTCAGCGAC ACCTCCACCA CCAGCCTGGT AAACATCGGC ACCCTGTCCA AGAGCTATAA GCTGGTGCTC CAGGGAACAT CCACCCCCGG CAGCGTTACC GGCCTTCAGT TCAATATCTC TCTCCCGGCC GGCGTCACCG TCAATGTCAA CAGCTTAACC TCGGCCGTCC TGGCCAGCAG CCTCGCCCTT TCTAGCAATC CATCTTCAAG CTCATTGCTG GCGGCAAAAT ACTCACCCGG CAGCCTAAAC ATTGGCATTG TTAATACAAG CGGGTTCAGC GTTGGCGACG TGGCCACCTT GACCTGCAAC ATCCCCGCTG GAGTGAGCGT ACCCGAGCCC TCAGCCTTCA CCGTCACCAA CCTCAAGAGC ATCGACAAAT TAGGCGCTAC CGTGACAGGA GCCACGATTA CAGTTAATTA A
|
Protein sequence | MKIFATIASS LLLMLVITAC GGGSSPTPVA ANTTVSGIAS KGPIVNGVVK IYSVIDGAKS TLLAQTTTDA NGNYTANLGS YVGPIIVEAS GSYLDEATGT TKTIPADSPI HAALPLAQGA VNLPVTALTE LAYIKTGGDL TASAISTANT LVSDLFKVDI IATSPVAPTT EALKTATQAE KDYTLALAAI SQMASTTTGA SDTDKLNNAL STMGQGISST GMTSDTAATV QAALTTFVTT NANNNTGVSD TSTTSLVNIG TLSKSYKLVL QGTSTPGSVT GLQFNISLPA GVTVNVNSLT SAVLASSLAL SSNPSSSSLL AAKYSPGSLN IGIVNTSGFS VGDVATLTCN IPAGVSVPEP SAFTVTNLKS IDKLGATVTG ATITVN
|
| |