Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_2095 |
Symbol | |
ID | 8137431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2432315 |
End bp | 2433667 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644869710 |
Product | hypothetical protein |
Protein accession | YP_003021905 |
Protein GI | 253700716 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.0000000000101265 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCGAATT CTGAGCCGCT TTGCCTGATC ATCATCCCCT ACGACAAGAA AAAGGACGCC GCCGGCAAGG TGGTCGATTA CGAGACGGTG TACCGCAGGC TGATCGTGCC CGCCGTGGAG GCGGCCGGGC TCTTGCCGGT GCGCGCCGAC GAGCAGAGGA CGGGATGCCT CAACCTCGGG CAGCCCCTTG AGCGGCTGAG GTATTGCGAC TGCGCCCTGG CCGACCTCTC CACCGGCGAC CCCAGGGTCT CGTACCAGCT GGGGATCCGG CAGGCGGTGC ATCCGGTACA GACGCTCCTT GTTTACGCCG CGGGTTGCGC CCAGCTCCAG CTGGAAGCGG AGGGGCTCCC GGCGGTCCCC TATCGCGTGT CGCCGCACGG CCAGCCGATG CACGAGGCGA AATACCGGGC TCTTTTGACG GAGCGTCTGG TGGAGGCGAT GGAAGGCGCC GTGGAAAACG AGGTTTACCG TGCCCTGCGC GGGGGGAGCG GGGAGCCGGC GGCGAAAAAG GAGAGGATAG CGGGGCGGGA GCTTCACGCC ACGGTCTTCG CCAACCTGCT CAAAAAGGGG CGCGCGGGGG GGCTTGCCGC GCTGCGCGAA TTGCAGTCGG AACTCTTCGC TTCGGGCGAG GTCGACCCGG CGGATCTGGT CGAGCTTTTG CTAAGCTATC GCGCGGTGAA CGGCTGGAGC GAGATCATCG AGCTGGCGCG GCGGATGCCC CCGGCCCTGG CGGACAGCGT CCTGGTGCAG CAGCAGTTGG TGCTCGCCCT GAACTGGGCC GGGGAGGGGG AGCATGCGGA GCAGGCGCTG CGGCAGCTCA TCGCCCGGCG CGGGCCGAGC AGCGACAGCT ACGGCATCCT GGGGCGGATA ATGAAAGACC GGTGGGAAAA GGCGCTTGAG CGGGGCGATG TGGAGCTTGC CAGGGAACTG CTGCAAAAAG CCGCCTCGGC GTATCTGAAA GGGTTCGAGG CGGACTGGCG CAGCACCTAT CCCGGAGTGA ACGCGGTGAC CCTCATGGAG CTGAAGGAGC CGGCGGACCC GCGCCGCCGC GAGATCCTCC CCGTGGTGCA CTACGCAGTG GAACAGCGCA TCCGCTCAGG GGCCGCCGAT TATTGGGACT ACGCCACGCT CCTCGAGCTG GCCGCTTTGT CCTGCGACGA GGCGAAGGGG AGGGATGCCC TTGGGCGCAG CCTCGCCATG GTGCGCGAGG CGTGGGAGCC GGAGACGACG GCGCGCGACC TGCGGCTTGT CCGGGAAGCG CGCCGGCGCC GCGGGGCGGA GTGCCCGGCA TGGGCCGAGT ACGCCGAGTC CGAATTGCTG AAGGCCGCCG AGCGCGGCAC GGCGCGCCCC TAA
|
Protein sequence | MPNSEPLCLI IIPYDKKKDA AGKVVDYETV YRRLIVPAVE AAGLLPVRAD EQRTGCLNLG QPLERLRYCD CALADLSTGD PRVSYQLGIR QAVHPVQTLL VYAAGCAQLQ LEAEGLPAVP YRVSPHGQPM HEAKYRALLT ERLVEAMEGA VENEVYRALR GGSGEPAAKK ERIAGRELHA TVFANLLKKG RAGGLAALRE LQSELFASGE VDPADLVELL LSYRAVNGWS EIIELARRMP PALADSVLVQ QQLVLALNWA GEGEHAEQAL RQLIARRGPS SDSYGILGRI MKDRWEKALE RGDVELAREL LQKAASAYLK GFEADWRSTY PGVNAVTLME LKEPADPRRR EILPVVHYAV EQRIRSGAAD YWDYATLLEL AALSCDEAKG RDALGRSLAM VREAWEPETT ARDLRLVREA RRRRGAECPA WAEYAESELL KAAERGTARP
|
| |