Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1503 |
Symbol | |
ID | 8136832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 1757379 |
End bp | 1758407 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 644869115 |
Product | hypothetical protein |
Protein accession | YP_003021317 |
Protein GI | 253700128 |
COG category | [R] General function prediction only |
COG ID | [COG1611] Predicted Rossmann fold nucleotide-binding protein |
TIGRFAM ID | [TIGR00725] conserved hypothetical protein, DprA/Smf-related, family 1 [TIGR00730] conserved hypothetical protein, DprA/Smf-related, family 2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 75 |
Fosmid unclonability p-value | 0.641769 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACTGC GTTTTAACAG AACAAACGAT GAAATCGACC AGATGATCGA CGCATTGATG GAGAAAGCTG GAGGGGTGCA CCACCCGGAC CTGGCACGCG AGATGATCAT CTCCGCACTG AAGGCGGGGC AGGACACCGA TTATTTAGCC GACCTCAAAC TTCTGAGCAA CACCATGAAG GAGATGCGCT ACACCACGAA GATCTTCGCT CCCTACCGGC ACAAGAAGAA GGTGACCATC TTCGGCTCCG CCCGGACCCG CCCCGAAGAG CCGATGTACA AGAAGTGCAT CGACTTCGCC GCCCTATTGG CGGAGAAGGG ATACATGATC ATCACCGGCG GCGGCGGCGG GATCATGCAG GCCGGAAACG AGGGAGCCGG CAGCGAATCG TCCTTTGCCG CCAACATACG GCTCCCGTTC GAGCAGTCCG CGAACCGGGT CATGCTGAAG AACCCGCGAC TCATTACCTA CAAGTACTTC TTCAACCGCA AGGTGGCCTT CGTGAAAGAA TCCGACGCCA TCGCGGTATT CCCGGGCGGC TTCGGGACGC TCGACGAGGC GATGGAAGTA TTCACCCTGA TCCAGACCGG GAAGACTTCC CCCAAACCGC TGGTTCTGGT TGACGACGAG GAAGGGTACT GGGAGCACTT CTTCAGGTTC ATCAAGGAAA GACTGCTGGT TATGGGGTTC ATCTCCGCAG AGGACTTCTC CATCTTCACC ATCACCAAGA GCTACGAGGA AGCGGTCCAG GTCATCGAGG AGTTCTATAC CAACTACCAT TCTATGCGGT TCGTCAACGG CGAGCTCATC ATCCGTGTAA CGAAAATTCT GGCTCCCGAG CAGATCGAGA TGCTGGAGAA CGAATTCCCC GAATTGAGAT TAAACAACAG CCGGATCGAA TTAATTAGCG CTCGACCGGA GGAAGCGGAC GAGCCGGATC TCCTTGATTT GCCGAGGATA GCCTTCCACT TCCACCACCA GCACTACGGG CTGCTGATGG CCTTCATTAG GCGGCTGAAC ACCTTCTGA
|
Protein sequence | MQLRFNRTND EIDQMIDALM EKAGGVHHPD LAREMIISAL KAGQDTDYLA DLKLLSNTMK EMRYTTKIFA PYRHKKKVTI FGSARTRPEE PMYKKCIDFA ALLAEKGYMI ITGGGGGIMQ AGNEGAGSES SFAANIRLPF EQSANRVMLK NPRLITYKYF FNRKVAFVKE SDAIAVFPGG FGTLDEAMEV FTLIQTGKTS PKPLVLVDDE EGYWEHFFRF IKERLLVMGF ISAEDFSIFT ITKSYEEAVQ VIEEFYTNYH SMRFVNGELI IRVTKILAPE QIEMLENEFP ELRLNNSRIE LISARPEEAD EPDLLDLPRI AFHFHHQHYG LLMAFIRRLN TF
|
| |