Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1508 |
Symbol | |
ID | 8136837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1763026 |
End bp | 1764096 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644869120 |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_003021322 |
Protein GI | 253700133 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 74 |
Fosmid unclonability p-value | 0.577537 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCTTT CGGTAGAAAA AAAGGTAGGC ATGTTCTTCC TGTTCGGTCT CATCATCCTG GGGGTGCTCC TTGAGGTGGG TGAGAAGTGG AACCCGTTCG AGAAGAACGT CCCCTATAAG ACCTATCTCA CCAGCATCAC CGGGTTGAAG GTAGGCGACC CGGTGCGGCT TGCCGGCGTA GACGTGGGGC GCATCAAGGG AATCGACATC CTGAACGACA AGATCGAGAT CAACTTCGAG GTGAAGCCCG GTACCAGGAT CAAGACGGAC ACGGTTGCGG GACTGCGGCT CACGAACCTT CTGGGCGGCC AGTTCCTGGG GCTTTCCTTC GGATCCCCCA ACGCGCCGCT TCTGGAGCCG GGCGGGGTGG TGAAAGGGAA GGACGTCGCC AACATCGACA TCATCGTCGA CAACCTGAGC GACGTGGTGA AGGACGCCAA GGTCTTCGTG AACAACCTGG ACCGCAACCA GGACATAGTA CTTAAGAAAA TCTCGGTGAT GCTGGACGAG AACCGCGGCA ACCTGAGGGC GTCCATCTCC AACATCAACA GCATTACCGG CAAGATGGAC CGCGGCGAAG GGTCGCTGGC GCTGCTTTTG AACGAAAGAA AGCTCTTCGA CGACGCAAGC GGCGCGGTGG AGAGCCTGAA GGTGGTGGCT GGCAAGATCG AGCGCGGCGA GGGGACTCTC GGGAAGCTGG TCAACGACGA ATCTGTCTAT CGCGAAGCCT CCGCCCTCAT CACCGACCTC AGGGCGGGCG CGAAGGACCT GAACGCCGGG ATGAAGGACG TGAAGGAGAT CACGGCCAAG GTCAACCGCG GCGAGGGGAC CCTGGGCAAG CTGATCAACG ACGAGACGCT CTATGTCGAC CTGCGCGAGG CGTCGAAAAA CGTGAAGGAA ATCACTCAGA AGATCAACTC CGGTCAGGGG ACCCTGGGAA AACTGGTCAA CGAGGACCAG CTCTACCGCG ACACCACCGC CACGCTCAAG AAGACCGAGC GGGCCATGGA AGGTCTCGGA GACGCAGGTC CCATCTCCGT CATCGGCTCC ATCGTGGGGA CCCTCTTCTA G
|
Protein sequence | MALSVEKKVG MFFLFGLIIL GVLLEVGEKW NPFEKNVPYK TYLTSITGLK VGDPVRLAGV DVGRIKGIDI LNDKIEINFE VKPGTRIKTD TVAGLRLTNL LGGQFLGLSF GSPNAPLLEP GGVVKGKDVA NIDIIVDNLS DVVKDAKVFV NNLDRNQDIV LKKISVMLDE NRGNLRASIS NINSITGKMD RGEGSLALLL NERKLFDDAS GAVESLKVVA GKIERGEGTL GKLVNDESVY REASALITDL RAGAKDLNAG MKDVKEITAK VNRGEGTLGK LINDETLYVD LREASKNVKE ITQKINSGQG TLGKLVNEDQ LYRDTTATLK KTERAMEGLG DAGPISVIGS IVGTLF
|
| |