Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3131 |
Symbol | |
ID | 8138481 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3635941 |
End bp | 3637503 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644870735 |
Product | hypothetical protein |
Protein accession | YP_003022917 |
Protein GI | 253701728 |
COG category | [S] Function unknown |
COG ID | [COG1808] Predicted membrane protein |
TIGRFAM ID | [TIGR00271] uncharacterized hydrophobic domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 170 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCGGA CGCACCTGAA CTACTACAGG CGGAAACTGA CGGTCTTTCT CGCGGAAAAG GCGGACATGG TGCAGCACCG CGAGGTGATC AGGGAGGTCG CCTCCGGGGT CGAGCGGAGC TGGGTCTACT ACCTGATGCT CGTGGTGGCG GGGCAGATAG CGCTTCTGGG GCTGCTCACC AACAGCGTCG CCGTGGTGAT CGGCGCCATG CTGATTTCCC CGCTCATGGG GCCGATCATC TCGTCGAGCC TCGCCTTGAC CATAGGCGAT CTCTCCTTGG CGCGCCGCGC CTTCAAGACC ATCGCCGTGA GCGTGCTGCT CACCGTGGCG GTGAGCGCGC TGATCAGCCT CGTCTCGCCG CTGAAGGAGC CGACCGCCGA GATCCTGGCC CGGGTGAGGC CCAACATCCT GGACCTCTTC GTGGCGGCGC TCTCGGGGGT CGCGGGCGCG GTCGCCCTTT GCACCAAGCG CAACTATGTG GTCACAGCCA CGGGGGTCGC GGTCGCGACA GCGGTCATCC CCCCCTTGAG CGTGGTGGGG TACGCGATGG GGACCTGGCA GCCCAAGCTG GCGCTGGGGG GCTTTCTCCT TTTCTTCACC AACTTCGTCG CCATCGTGCT CGCCTCGGAC CTGGTCTTCT TCACCCTGGG CTTCAGAACC AGCCTGGCGG AAGAGACTTC CTTCTCGCAC CGCACCAGGA TCGCGGTGAT CGGTGCGGTG CTGGCCCTGG TGTCGGTCCC CCTGGTCTAT ACCTTGGGTG CGGACGTGGC GAGGCTGAAG GAGAAGAAGC GGATCGAGCG CATCCTGAAG AGTCACCTGA ACCGCGAGCA GGTCTCCCGC CTCACCGGCT ACCAGCAGAC GCCGCGAGAC AAGGAGCTAT TGGTGCGGGC CTCGGTCAAC ACGGTGGCGC TGATCGATAG GCCCGAGCGG CAAAGCATGG AGCAGGAATT GGCACGGGGG CTGAAGCGCC CGGTGCGCCT GGAACTGGAG CAGGTGATAG TCGCCTCCGG GCGGGAACTG GCGCCCGTCG AAGGGAAGCG GGAGCCTCTC CCGGTGAGCC GGGGACAGCT TTCAGCGGAA GTTGGGGCCA TGGTGGCAAG CGCCGAGCAG GAACTGTCGA GGGCGCTCGA GCCTTTTCCG GTGAGCCGGA CCAAGGTCAC CTTCGCCGCG CCGGGAGAGC CTTTGCTGAT CACGGCGACC CTTAGGCGCG ACTACCCTTT GAGCCGCGAC GAGCTGCAGA TCCTGTCGCG GGAGCTGGCG CGGGTGCTGG AGCTCCCGGT GGAGCTGAAG GTCGAAGCTA AGCCGCTTCT GCCGCAGCTG ACCTTCACCG CCGACGGCGA ACCGGTACCG CAGACCCAGC AGGCTCTGGA AATCGTCAAG AGTCTGCCGG AAGGGCCCGC CTCCTTCCGC TTCCGCCTCT CCGCGCCCCC CGACCGGCGC CGGGAGGCGC TCTCCCTCAA GGGGTACCTG ACCGGGAAGC TCGCCGTACC CGAGTCGGTG CTGTCCGTTT CGACGCAGAC GCAGAAAAAG CACGGCGTTA CCTTGAGCGT CGTGCGGCAG TAA
|
Protein sequence | MIRTHLNYYR RKLTVFLAEK ADMVQHREVI REVASGVERS WVYYLMLVVA GQIALLGLLT NSVAVVIGAM LISPLMGPII SSSLALTIGD LSLARRAFKT IAVSVLLTVA VSALISLVSP LKEPTAEILA RVRPNILDLF VAALSGVAGA VALCTKRNYV VTATGVAVAT AVIPPLSVVG YAMGTWQPKL ALGGFLLFFT NFVAIVLASD LVFFTLGFRT SLAEETSFSH RTRIAVIGAV LALVSVPLVY TLGADVARLK EKKRIERILK SHLNREQVSR LTGYQQTPRD KELLVRASVN TVALIDRPER QSMEQELARG LKRPVRLELE QVIVASGREL APVEGKREPL PVSRGQLSAE VGAMVASAEQ ELSRALEPFP VSRTKVTFAA PGEPLLITAT LRRDYPLSRD ELQILSRELA RVLELPVELK VEAKPLLPQL TFTADGEPVP QTQQALEIVK SLPEGPASFR FRLSAPPDRR REALSLKGYL TGKLAVPESV LSVSTQTQKK HGVTLSVVRQ
|
| |