Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1979 |
Symbol | |
ID | 8137313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 2293073 |
End bp | 2294083 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644869592 |
Product | Rhomboid family protein |
Protein accession | YP_003021789 |
Protein GI | 253700600 |
COG category | [R] General function prediction only |
COG ID | [COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 105 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGTCCG CTTTCCAGGT GACTCAGGGG GAAGGATCTG ACATACTTGC CGCCATGAAA AACGAGCAGG AAGAAAACCT GGAGAGTGCG GAGGAGTGGG TGGCGGTGCC GCCGGCCAAG GTGGAAGCCC AGGCGGGGAC GCGCCTGGCG CAGCGGCGCG CGCGACTTTG GGCTCTGGTG CTGGAAGCGC GCTACATCGA AAACCGGGTC GAGCCCGGAG GGGGAGGATG GCAAGTACTG GTCCCCCCCT CACGGTTGGA GGACGCCTGC CGCGAACTGC GCCTTTACGC CGAGGAGAAC CACAACTGGC CGCCTTTCCC GCCGCCGGTC CGCCCGATGG CCAAGAACAC GCTCCCCACC CTCTGCGTAC TGCTGCTGCT CGCCACCTTC CACAACCTGA CCAACCTCGA CCTGACCGTG ATGGGGCGCC ACCCGGTGAA CTGGGCCGAG ATCGGCAGCG CGCACGCCGG CGCCATCCTG CGGGGCGAGT GGTGGCGCGT CGTCACCGCG CTCACCCTGC ACGCGGACGC GCTGCACCTC ATGAGCAACC TCGCCATCGG CGGGTTCTTC ATCGTCTACC TCTGCCGGGA CCTAGGCTCC GGGCTCGCCT GGAACCTGCT TTTGGCGTCA GGTGCCTGCG GCAACCTCGC CAACGCCTAC ATTCAGCTCC CAAGCCACAA TTCGGTCGGC TCCTCCACAG CGGTCTTCGG AGCTGTAGGC ATTCTGGGCG CGATTTCCAT GATGCGCTAC CGGCACCACC TGCGCAGACG CTGGCCCCTG CCGGTCGCTG CGGCGCTCGC GCTGCTGGTG CTCCTCGGCA CCGAAGGGGA ACGCACCGAC CTGGGTGCGC ACCTCTTCGG CTTCTGCTTC GGCTCGCTAT TCGGTGTGGT GGCGGAACTC CTGGTGGGAT ACCTGGGGCA GCCGAAGCGG CTGGTCAACG CGCTCCTCGC GCTGGCCAGC GCCTCGGTAG TCGTCGCCGC CTGGATGTCG GCGCTTAACT TTCAGGGGTA G
|
Protein sequence | MQSAFQVTQG EGSDILAAMK NEQEENLESA EEWVAVPPAK VEAQAGTRLA QRRARLWALV LEARYIENRV EPGGGGWQVL VPPSRLEDAC RELRLYAEEN HNWPPFPPPV RPMAKNTLPT LCVLLLLATF HNLTNLDLTV MGRHPVNWAE IGSAHAGAIL RGEWWRVVTA LTLHADALHL MSNLAIGGFF IVYLCRDLGS GLAWNLLLAS GACGNLANAY IQLPSHNSVG SSTAVFGAVG ILGAISMMRY RHHLRRRWPL PVAAALALLV LLGTEGERTD LGAHLFGFCF GSLFGVVAEL LVGYLGQPKR LVNALLALAS ASVVVAAWMS ALNFQG
|
| |