Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3112 |
Symbol | |
ID | 8138462 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3608911 |
End bp | 3610221 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644870716 |
Product | NHL repeat containing protein |
Protein accession | YP_003022898 |
Protein GI | 253701709 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 155 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGCTA ATCGTGGTGA TCGCAAGGTC CGTAGTGTGG TTCGAATGCT GGCGCTGGTG CTTTTGTCCG CCTGCATGAC CGCCTGCGCC GGCAAAACGG CAAAGAACAG CGTCTTTTGG CCCGGCGCGC CCGATCTGCC CCGCATCCAG TTCCTGACCG CATTCAAGGA TTCGAAGGAT GTCATCGGCG AGAAGAAGCT TTCGCTTCTC GACGTCGGGG GAACGCCGGA CATCTTCATC AACCTGGTGA AGCCGTACGG GATCACGTCC GCCAACGGGA AGCTGTACAT CTGCGACACG CTGCAGGCGG ACGTCATCAC CGTCGACCTC CCGAACAAGA AGATGACCCG CCTCTCCGGG AACGTGAACG CCGGCAGGCT GAAAAAGCCG GTCAACGTGG CCGTCGACGC CAGGGGGAAC ATCTACGTCG CCGACACCTC GCGGCTCGAA GTGCTGCAGT ACGCCCCGGA CGGCTCCTAC GTAAGGAGCA TCGGCACCAG CAAGGACCTC AAACCTGTCG ACGTGCGCGT CGACGACCTC TACCTCTACA TCCTGGACGG GATGACCAGC CAGGTCCACC TCTACGACAT CGCCGGCGGC GACTACGTCA AGTCGATCGG CAGAAACGAC GACCCCAAGA GAAACCTGGC CGGACCGACC AACATGGCGC TCGACAGCAA GGGGGGGGTC TACGTCAGCA ATTTCGGCAG CGGCCGGATC ATCAAGCTGG ATAGGGACGG GAACTTCCAC CTGGGATTCG GCAAGCTCGG CAGCTCCTTC GCCGATTTCA CCCGCCCGCG CGGGATAACC GTGGACGAAA CCGGCCTGGT GTACGTGGTC GACGCCGGCG CGCAGCACGT GCAGATCTTC GACGACAGGT TCCGGCTGCT GCTGCTCTTC GCCGGCCCGG GGACTCCCGG TTCGCTCAAC ATCCCCGCGG GGATCACGGT CTCCACCGAC AACCTGGACT ACTACCAGAC GCTCGCCGAT CCCGACTTCA AGCTGGAGAA GGTCATCTTC GTGGTGAGCC AGGTGGGAGA GCACAAGGTG AGCGTCTACG GCCTCGGGAA AAAAGAGGGG ATCGATTACG CCGCCGAGGA AAAGAAGACC ATGGAGGACG TGAAGAAAAG GGCGGCCGAG GCGGCGGAGC GGCGGCGCAA GCTGGAGGAG GAGAAGGCGG CGAAGGAGCT CGAAGGTGGC GAGACGAAAG CCGCGGCTCC CGCGCCGGAG CCGGCGGCAC AGGCTGCGGA CGAGCCGGTC AACATCCCCT GGGCCGGGAA AGCGGCCGGC GCCACCCCCC CTGCGCGCTA G
|
Protein sequence | MQANRGDRKV RSVVRMLALV LLSACMTACA GKTAKNSVFW PGAPDLPRIQ FLTAFKDSKD VIGEKKLSLL DVGGTPDIFI NLVKPYGITS ANGKLYICDT LQADVITVDL PNKKMTRLSG NVNAGRLKKP VNVAVDARGN IYVADTSRLE VLQYAPDGSY VRSIGTSKDL KPVDVRVDDL YLYILDGMTS QVHLYDIAGG DYVKSIGRND DPKRNLAGPT NMALDSKGGV YVSNFGSGRI IKLDRDGNFH LGFGKLGSSF ADFTRPRGIT VDETGLVYVV DAGAQHVQIF DDRFRLLLLF AGPGTPGSLN IPAGITVSTD NLDYYQTLAD PDFKLEKVIF VVSQVGEHKV SVYGLGKKEG IDYAAEEKKT MEDVKKRAAE AAERRRKLEE EKAAKELEGG ETKAAAPAPE PAAQAADEPV NIPWAGKAAG ATPPAR
|
| |