Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_4118 |
Symbol | |
ID | 8139492 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4701828 |
End bp | 4703684 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644871733 |
Product | hypothetical protein |
Protein accession | YP_003023891 |
Protein GI | 253702702 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 71 |
Fosmid unclonability p-value | 0.41011 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGGGA TGACTCCAGC AGATAACCAT AAGAAAGCGT CGTTGTTCCT TTTGCTGCTC GCACTCCTCG CGCTGCCTGG ATGCACCCGC GCCGGCAAGG TCAGCCAATG CGTGGTCTGC CACCCGAAGA TCGAAAAGGT GTCGAAAAGC CATGCCGACT GCGTCTCCTG CCACGGCGGC GACGCTTCCA TAAGGAACAA GCACGCCTCC CACCTGGCGA TGTACGGCCC CCGGAACCCG GCGGCGCCGG AGCACTGGGA AAATACCTGC GGCTCCTGCC ACCAGTACCA GTTGGACCGG GTGCGTTCCA ACCTCATGTA CACGACGACG GGGATGATCA AAAACATCCA GCTCACCTGG GAGGGGCCGG AGGGGCTCTA CAGCAGCAGG GGAGGGGACG AGTACGATCC GGCGGGAAAA CCGCGCCGGC TTAAGCCGGT GGCCGAACTC GACCATATCT CCGGCGAGTT GTACCGGAAG TTCTGCTCGC AGTGCCACGT GGCCACGGAA AGCGGCGAGG TCTACGGTGC GAGCCACGCC TCCGGCTGCG CCGCCTGCCA TTTCCCGTAC AACGACCGCG CCACCTACCA GGGGGGGGAC GCTGCGGTGC GGGGGAAGGG GCCATATGCC GCGAGCCACG CCATGGAGAC GCTCCCGGGG ACCGAGGTCT GCGCGCGCTG CCACAACAGA AGCGGACGGA TCGCCCTCTC TTACCAAGGG CTCTACGACG GGAACAACTC GATGGTCCCC ACCAGAAACG GCCGGCCCGG TCCGGTGATG ACCTCGGGGG GGCGCAACCT CACCCATATC GCCTCCGACG TCCATTTTGC CGCCGGCATG GAGTGCATCG ACTGCCACAC CTCAAGGGAC ACCATGGGGG ACGGCTACGG CTACGAGAAC ATGTACATGC AGACCGAGGT CTCCTGCGAG GACTGCCACG GCGGGGCGAG CCCCCCGCGC TACGAGCGGA TAGCCGGCGA GAGCGACGAG GCCATCCGCG AATCGCGCGG CTACGCCATG CAGATGCGCC AAGGGATGAA GATGATCCTC ACCGCCAAGG GGCGCAAGTA CTCCAACGTC TTCTACCGCG ACGGCGCCGT CTGGGTGCTG GGAAAAAGAA GCGGCAAGCT CTTCAAAAGC CGTGTGATCA CCGGGACCCC CGAGCACAGC GTGGCCGGCC ACGGCAGGAT GGAATGCTAC TCCTGCCACT CCCGCACCGT TGTCCAGTGC TACGGCTGCC ACACCACCTA CGACAGGAGC AAGCCGGGGA TGGATTACAT AGCCAAAATG GCGACCCCCG GGCGCTTCAG CGAGAAGGAA GATTACCGGA TGCTCTACCC CTTCCCGCTG GCCCTGAACC AGCGAGGGAA GATCTCGACG GTCACCCCCG GGTGCCAGAC CTTCGTCACC GTGGTCGAGC CCGACCTCTC CGTCTCCAAG GACGAGTACG TCGCCAGGTT CAAGGGGAAA AAGCAGCTGC GCTTCGCCCC CTTTTACTCG CACAACACCG GAAAGAAGGC GATCGGCTGC GGCGAATGCC ACGGCAACCC CGCCTTTCTA GGCTTCGGGC AGCACGTGGT CTCGGGGGGG GATATAGAGG GGACCCTGAT CTGCGAGCAG TCCGCCGACA AGCCCTTGGA CGGCTTCCTC ACCCTGCAGG GGGGTAAGGT GCGCGCCTAT TCCGCCATCA CCCGGGAGAG CTCGCGGCCG CTGAACGGGG CGGAGGTGCG GCGGGCGCTG TCGGTGAACC TCTGCCTGGT CTGCCACGAA AAGGCCAAAG ACCCGATCTA TCGAAAGGAG CTGGATTATC GTGCGCTCAA TGATGCTCTG CATCGTCGCC TGCTTTCTGC TCCTTAG
|
Protein sequence | MDGMTPADNH KKASLFLLLL ALLALPGCTR AGKVSQCVVC HPKIEKVSKS HADCVSCHGG DASIRNKHAS HLAMYGPRNP AAPEHWENTC GSCHQYQLDR VRSNLMYTTT GMIKNIQLTW EGPEGLYSSR GGDEYDPAGK PRRLKPVAEL DHISGELYRK FCSQCHVATE SGEVYGASHA SGCAACHFPY NDRATYQGGD AAVRGKGPYA ASHAMETLPG TEVCARCHNR SGRIALSYQG LYDGNNSMVP TRNGRPGPVM TSGGRNLTHI ASDVHFAAGM ECIDCHTSRD TMGDGYGYEN MYMQTEVSCE DCHGGASPPR YERIAGESDE AIRESRGYAM QMRQGMKMIL TAKGRKYSNV FYRDGAVWVL GKRSGKLFKS RVITGTPEHS VAGHGRMECY SCHSRTVVQC YGCHTTYDRS KPGMDYIAKM ATPGRFSEKE DYRMLYPFPL ALNQRGKIST VTPGCQTFVT VVEPDLSVSK DEYVARFKGK KQLRFAPFYS HNTGKKAIGC GECHGNPAFL GFGQHVVSGG DIEGTLICEQ SADKPLDGFL TLQGGKVRAY SAITRESSRP LNGAEVRRAL SVNLCLVCHE KAKDPIYRKE LDYRALNDAL HRRLLSAP
|
| |