Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gura_4041 |
Symbol | |
ID | 5165928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter uraniireducens Rf4 |
Kingdom | Bacteria |
Replicon accession | NC_009483 |
Strand | - |
Start bp | 4698342 |
End bp | 4700081 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640551520 |
Product | hypothetical protein |
Protein accession | YP_001232758 |
Protein GI | 148266052 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTAAAG GGAACAATGT GCCTACGATA GAAGCGGATG ATTTTGCTCG ACGTTTTTCT TTACGGGCTG GGAACCTGAT GTGGCTACTC GGTGCCGGTG CATCGGCCTC AGCTGGGATA CCCACTGCCG GAGATATGGT CTGGGAGTTT AAGCAGCAAC TATTCATCAG CCAACGACGA GTTTCGCACC AGTCTATGGC CGATCTGTCG AATCCCACAA TTCGTGCTCA GTTACAGGCC TACGTTGATT CCTCAGGGAG TTTGCCTTCT CCCGGCTCCC CGGACGAGTA CGCAGCACTT TTTGAAGCAG TTTATCCTGC AGAGTCGGAT CGTCGCGCCT ATTTGGATGC CAAAATGGGT GGAGCAAAGT TGTCCTATGG ACACTTGGCC CTCGCTTCTC TGATGCATGC ACAACTTACG CGTTTGGTGT GGACAACCAA CTTCGATCCA CTCGTGGCGG ATGCTTGTGC GAAGGTATAT GACGGAACAG GACCACTCAC GACTGTTGCT CTCGAAGCGC CTGATTTAGC TGCTCAGTGC ATAGGAGGAG GAAGATGGCC AATCGAGGTA AAGTTGCATG GAGACTTCAG ATCTCGGCGA CTCAAAAACA CTGGTGATGA ATTGCGTTAC CAAGATCAAC GTTTAAGGCA ACTTCTAGTA GATTCCTGCA AACGCTTTGG ACTTGTTGTA GTCGGGTACA GTGGCCGTGA CGATTCCATT ATGGATGCAC TGGAGGAGGT GCTGGAACAA AATGGCGCTT ACCCTTCCGG ATTGTTCTGG CTGCATCGCG GGGAAGATCC ACCGCTCGCC CGAGTTGAAC AATTGCTGGC GCGAGCGAAA CAGGCTGGAG TGGAGTCAGC ACTGATAAGG GTTCAAAACT TCGATGAGGC AATGCGAGAC TTGGTGCGAA TGGTAAAAAG CATCGACACC ACGATACTCG ACACCTTCGC AGCCGAGCGC CGCCGCTGGA GTAGCGCCCC GCCGCCAGGA GGAAAACGAG GCTGGCCGGT GGTGCGCCTC AATGCAATAC CTGTCGTACA AATTCCAACA GTATGTCGAC GCGTTGTCTG TGAGATCGGC GGCCACGCAG AAGCACGGGA AGCTGTCAAG CAGGCTGGCG TTGACGTCCT CGTTGCTCGC ACTCGGGCTG GCGTGTTGGC CTTTGGAGCC GATGGCGACG TGCGTGCAGC TTTTGGTGGT TACAACATTA CTGATTTTGA CCTACATACC ATCGACAACA AGCGACTGCG CTACGATTCC GGCGAGCGTG GCCTGTTACG CAGTGCACTT ACCCGTGCTT TAGAACGCCA TCATCGATTG GACGCAACTC GTCGGAGAAG TGCTGATTTG TTAGCACCAT CGGACCCAAG AGAGAGTGTC TGGGCACCTC TAAAGCAGCT TGTAGGATCA CTTAACGGTA CGGTCAGCGG TTTCCCCGGT TTGCATTGGC GCGAAGGAAT CGGTACTCGG CTTGACTGGG CTGATGAACG CTTGTGGCTT CTGATAGAAC CCCGCACGGT CTTCGACGGC ATTAACGACG AGAACAAGGC GGCTGCCGCC GATTTCGCCC GTGAGCGAAC TGTCAAGCGT TATAATAAGC AACTCAACGA TCTAATCGTA TTTTGGGCTG ATCTGCTCTC CGGCGGAGGA GACCTGCGTG CGTTAGATAT CGGAGGTGGG GTCGATGCCG TCTTTAGCCT TTCCAATATT ACAGGTTTTT CAAGGAGGGC TGGGGTATGA
|
Protein sequence | MGKGNNVPTI EADDFARRFS LRAGNLMWLL GAGASASAGI PTAGDMVWEF KQQLFISQRR VSHQSMADLS NPTIRAQLQA YVDSSGSLPS PGSPDEYAAL FEAVYPAESD RRAYLDAKMG GAKLSYGHLA LASLMHAQLT RLVWTTNFDP LVADACAKVY DGTGPLTTVA LEAPDLAAQC IGGGRWPIEV KLHGDFRSRR LKNTGDELRY QDQRLRQLLV DSCKRFGLVV VGYSGRDDSI MDALEEVLEQ NGAYPSGLFW LHRGEDPPLA RVEQLLARAK QAGVESALIR VQNFDEAMRD LVRMVKSIDT TILDTFAAER RRWSSAPPPG GKRGWPVVRL NAIPVVQIPT VCRRVVCEIG GHAEAREAVK QAGVDVLVAR TRAGVLAFGA DGDVRAAFGG YNITDFDLHT IDNKRLRYDS GERGLLRSAL TRALERHHRL DATRRRSADL LAPSDPRESV WAPLKQLVGS LNGTVSGFPG LHWREGIGTR LDWADERLWL LIEPRTVFDG INDENKAAAA DFARERTVKR YNKQLNDLIV FWADLLSGGG DLRALDIGGG VDAVFSLSNI TGFSRRAGV
|
| |