Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU1364 |
Symbol | |
ID | 2687959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | + |
Start bp | 1493171 |
End bp | 1494151 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637126039 |
Product | HNH endonuclease family protein |
Protein accession | NP_952417 |
Protein GI | 39996466 |
COG category | [V] Defense mechanisms |
COG ID | [COG3440] Predicted restriction endonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.163097 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTTTC TCCCCACCTC CCTCAAATTC TTCTCCTCCC TCTCCCGCGC CTCCGGCGCA GTCTGGACCG AGGCCACCAA GCGCAAGGCG CCGCACAAGC CGCTGCTGCT CCTGGCGGTG CTGGATCTGG TTCATCGCGG TGTCATCACT ACGCCGTTCA TCGCCGTCAG TGGCGATCTG GTGGAGCTGA ACGAGCTGTT CAACCTCTAC TGGCGGCGGA TCATCCCTCT CGGTCAGACC AGCAGCATCG CCTTCCCCTT CTCCCGACTC GCCCGCGAAC CGTTCTGGGA GCTGGTCCCC CAGCCGGGGA AAAACATCAC CGATGCGGTA ATCAACAACA CCTCCTCCGT CAGCTACCTG CGCAAGTACG CCCTGGGGGC GAAGCTGGAC GACGGGCTGT TCCGGGTCAT GGCGAGCGGG GAGGGGCGGG AGGCACTGCG GGAAGCGCTG CTCCTTTCCT GCTTCTCGCC CGAGGCGTCG GCGCAGCTGC GGGAGCAGTC GATCATCAAC CGAGAGGCGT TCGACTACAG CCGACTGCTG GAGGAACAGG CCCACCTGCC GCTGGTGAAG GAGATCGTCG AGGCGGACAA CTACCGGCCC ACGGTGCGGG ACCAGGCCTT CCGCAAGGTG GTGACCTCGG CCTACGACCA CCGCTGCGCC CTGTGCGGCA TCCGCATCGT CACCCCCGAC GGCCACACGG TGGTGGAGGC AGCCCACATC GTGCCGTGGA GCAGAAGCCA AAACGACGAC ATCCGCAACG GCATGGCCCT CTGCAGAACC TGTCACTGGG GCTTCGACGA GGGGATGCTC GGCGTCTCTG ACAACTACAC CGTCATCACC TCCCGCTCCA TCGGCATCGA CCCCAACTTC CCCGGCCTGC TCCAGACCCT CTCCGGCCGT GGCATCATCC CGCCGGCCGA CCCGGATAAA TTCCCGGCCC GCGAGTATTT GGCCGAGCAT CGCCGGGCGT GGCGGCTGTA A
|
Protein sequence | MTFLPTSLKF FSSLSRASGA VWTEATKRKA PHKPLLLLAV LDLVHRGVIT TPFIAVSGDL VELNELFNLY WRRIIPLGQT SSIAFPFSRL AREPFWELVP QPGKNITDAV INNTSSVSYL RKYALGAKLD DGLFRVMASG EGREALREAL LLSCFSPEAS AQLREQSIIN REAFDYSRLL EEQAHLPLVK EIVEADNYRP TVRDQAFRKV VTSAYDHRCA LCGIRIVTPD GHTVVEAAHI VPWSRSQNDD IRNGMALCRT CHWGFDEGML GVSDNYTVIT SRSIGIDPNF PGLLQTLSGR GIIPPADPDK FPAREYLAEH RRAWRL
|
| |