Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0725 |
Symbol | |
ID | 8136040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 869713 |
End bp | 871719 |
Gene Length | 2007 bp |
Protein Length | 668 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644868342 |
Product | Nitrate reductase |
Protein accession | YP_003020557 |
Protein GI | 253699368 |
COG category | [C] Energy production and conversion |
COG ID | [COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.0000000000000713238 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCGTCA TTACCAAGAG ATCGGTCTGC CCCTTCGACT GCCCGGACAC CTGCGGCATG CTGGTGGAGG TGAAGGGAGG AAAGGCGGTA GGCGTGAAGG GAGACCCGCT GCATCCCTTC AGCCGCGGCA CCCTCTGCCC GAAGATGCGC CACTACGAGA AAAGCGTGCA CTCGCCCTTG AGGCTCACTA CTCCGCTCAA GAGAACCGGC GCCAAGGGAA GCGGCGACTT CGCCCCCATC TCCTGGGAGG AGGCGGTCGC GACCATCGCC GAGCGCTGGC GCGGCATCAT CGCTACGCAC GGGGCCGAGG CGATCCTCCC CTACTCCTAC GCGGGGACCA TGGGGATTGT GCAGAGAAAC GCCGGGCACC CCTTCTTTCA CCGGCTGGGC GCCTCCCGGC TGGACCGCAC CATCTGCTCC CCGGCCAAGG GAGCGGGTTG GCAGGCGGTG ATGGGGCAGA CGGCGACACC CGCCCCGGAG ACGGCCGGCA AAAGCGACCT CCTGATACTG TGGGGGATCA ACGCGGCCGC CACCAGCATC CACTTCCTGA ACCAGGCGAA GGAGGTGCGG GCAAACGGCG GGGCGGTGTG GCTGATCGAC ACCTACGAGA CCCCGACCGC GTCGGCGGCG GACCGCACCT TCCTGGTCCG CCCCGGAAGC GACGGGGCCC TGGCTTTAGG GATCATGCAC GTCCTCGCCC GGGAGGGGCT CACCGACACG GCGTTTCTCG ACTCGCAGGT GCTGGGTCAC GAGGAGTTAA AGCGCGAGGT GCTCCCGCTC TACACGCCCG AGGTCTGCTC CCGCATCACC GGGCTCGCCG TGGCACAGAT CGAACTCATG GCGCGCTTCT TTGCCGCGGC CCGCGCCCCC TTCATACGCC TGGGGAGCGC GCTGTCGCGC TACGGCAACG GCGCCATGAC CGTGCGGACC ATCTGCTGCC TTCCGGCGCT GGTTGGGGCC TACGGCAAGG AAGGAGCAGG ATGCTTCCCC GACACCGCCA CCGGCGCCAC CTTCCCGATG GGGCTTTTGC TTCGGGAGGA CCTGATCCAG GGCTCGCCGC GCCTTGTCAA CATGAACCAG CTGGGACAGG CGCTGAACGA GCTCTCGGAC CCGCCGGTCA TGGGGCTCTA CGTATACCAC TCCAACCCCG CGGCGGTCAC CCCGGACCAA AACGCGGTCC TGAAGGGGCT CGCGCGGGAG GACCTTTTCA CCGTGGTGCA CGAGCGCTTC ATGACCGACA CGGCGCGCTA CGCCGACATC GTGCTCCCGG CCACCTCTTC GCTTGAGCAC AGCGACATCT ACCGCTCCTA CGGCACCTAC TGCATCCAGA GGGCGCAGGC GGCGATCGAT CCCGTCGGGG AGAGCAGGTC CAACTGGGAG GTCTTCTCGC TCCTCGCGGC GGAGCTCGGT TTTGAGGAGG AGCTCTTCAC CCTCTCCGCG GACCAGGTGA TCGACCGGCT CCTAGCCGTC CCCACGCCGC TGCGCCAGGG GATCGACGAG GAGGCGCTCG CCGCGGGACT CCCGGTGCAA TTGGCCCCCC GGCAAGCGGG ATACCGGACC CCCTCGGGGA AGATCGAGAT CCTGAACCGG CAGCTGGCGC ACCCGCTCCC CGTTTACCTC CCCACCCACG AGGAGAACGG CCCCCTCCCC TTTAGGCTCA TGACCGCCCC AAACCCCTAC GCCCTGAACG CCACCTTTTA CGAGCAGGAG GAACTAAGGG CGCGGCAGGG GGGGATGCAG CTTCAGATGA ACCCCGCGGA CGCCGCTGCC AAGGGGCTTG CCGACGGCGA GCGGGTGGTG GCCTGGAACC GGCTGGGAGA GGTGACCTTC CTACTCAAGA CGACGGAGAA GGTCCCCCAA GGCCTCGTGG TCGCGGAGGG GGTCTGGTGG CTCGCCTACG CGCCGGGGAG CCGCTCTGTG AACGCCCTCA CCTCGCAGCG CCTGACCGAC GAGGGTGGAG GAAGCACCTT CTACGACAAC CGCGTCGACG TGCGCCGGGA GCTCTAA
|
Protein sequence | MSVITKRSVC PFDCPDTCGM LVEVKGGKAV GVKGDPLHPF SRGTLCPKMR HYEKSVHSPL RLTTPLKRTG AKGSGDFAPI SWEEAVATIA ERWRGIIATH GAEAILPYSY AGTMGIVQRN AGHPFFHRLG ASRLDRTICS PAKGAGWQAV MGQTATPAPE TAGKSDLLIL WGINAAATSI HFLNQAKEVR ANGGAVWLID TYETPTASAA DRTFLVRPGS DGALALGIMH VLAREGLTDT AFLDSQVLGH EELKREVLPL YTPEVCSRIT GLAVAQIELM ARFFAAARAP FIRLGSALSR YGNGAMTVRT ICCLPALVGA YGKEGAGCFP DTATGATFPM GLLLREDLIQ GSPRLVNMNQ LGQALNELSD PPVMGLYVYH SNPAAVTPDQ NAVLKGLARE DLFTVVHERF MTDTARYADI VLPATSSLEH SDIYRSYGTY CIQRAQAAID PVGESRSNWE VFSLLAAELG FEEELFTLSA DQVIDRLLAV PTPLRQGIDE EALAAGLPVQ LAPRQAGYRT PSGKIEILNR QLAHPLPVYL PTHEENGPLP FRLMTAPNPY ALNATFYEQE ELRARQGGMQ LQMNPADAAA KGLADGERVV AWNRLGEVTF LLKTTEKVPQ GLVVAEGVWW LAYAPGSRSV NALTSQRLTD EGGGSTFYDN RVDVRREL
|
| |