Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3366 |
Symbol | |
ID | 8138733 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3898937 |
End bp | 3900103 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644870984 |
Product | protein of unknown function DUF147 |
Protein accession | YP_003023149 |
Protein GI | 253701960 |
COG category | [S] Function unknown |
COG ID | [COG1624] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00159] conserved hypothetical protein TIGR00159 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 82 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTCCCGC AATTCCGGCC CCAGGACATC GCAGACATAC TCATCATGAC CTTTCTGGTC TACCAGCTTT ACAGCTGGTT CAAGAACTCG AAGGCGCTCC AGGTCGTATT GGGACTGCTG TTTTTGGGGG TGATCTATTT CGTCACCAAG AACCTCGGGC TTTTCATGAC CAGCTGGATC CTGCAGGAAC TGGGAACCGT GCTGCTGGTG CTCTTGATCG TGGTGTTCCA GGCTGAGATC CGTCAGGCCC TTTACCGGCT GAGCCTGTTG CGCAACTTCT TCGACCGCGA GGAGAGCGCC TTGCGCATCG ACCTCCTTGA ATTCTCGGCC ACCGTCTTCT CCCTGGCCTC CCAGCGCATC GGGGCGCTGA TCGTTTTTCA GCGCGAGGAA CTGCTGGACG ACCACATCCT GCACGGCGTC CCCCTGGATT CCCTGGTGAG CGGCTCGCTT TTGACCACCA TCTTCATCCC TTCCTCGCCG CTGCACGACG GAGCGGTGCT GATCAAGGAC GGCCGGGTCT CGCTCGCCTC GTGCCATCTG CCGCTGTCGG TGAGCGCCGA CGTGCCGCAG CATCTGGGGA CCAGGCACCG GGCCGCACTC GGGCTGTCCG AGCGCTCCGA CGCCGCCATC GTGGTGGTTT CCGAGGAGCG GGGAGAGGTC TCCCTTTCCC TTGGAGGCGA ACTGCAGCCG ATGGCCTCCG CAGCGCAGCT CCACGAGAAA CTCACCTCCT TGCTGCAGCC CCTCTCCCCC GAACAACAGC GGGTGGGGCT CAAGTCCAGG CTTTTCGCCA ACCTCTGGCC CAAGGTGGCC ATCCTCTGCA TGGTGGTTGT CTGCTGGCTG CTGATCACCT TCCGGCAGGG GGAGATCCTG ACCATAACGG CGCCGGTCAC CTTCCACAGC CTCCCCGACG CGCTCACCTT GACTCGCAGC TATCCGGACC AGGTCGACCT CCAGCTCAAG TCGTTTTCGA ACCTCGTCTC TCCGAAACAG CTCGACATCG TGGTGGACCT CGACCTCTCC AAGGTAAAGG AAGGGAACAA CAATATCCAG ATCAGCAAGG AGCAGATCAA GCTTCCGCCG GGGGTCGTGG TGGTCAATAT AGAGCGCTCC CTGATCCGCG TTACGGCCGA ACGCAAGCCG TCGAGGGAGG AAAAGCGTCG CCGTTAA
|
Protein sequence | MLPQFRPQDI ADILIMTFLV YQLYSWFKNS KALQVVLGLL FLGVIYFVTK NLGLFMTSWI LQELGTVLLV LLIVVFQAEI RQALYRLSLL RNFFDREESA LRIDLLEFSA TVFSLASQRI GALIVFQREE LLDDHILHGV PLDSLVSGSL LTTIFIPSSP LHDGAVLIKD GRVSLASCHL PLSVSADVPQ HLGTRHRAAL GLSERSDAAI VVVSEERGEV SLSLGGELQP MASAAQLHEK LTSLLQPLSP EQQRVGLKSR LFANLWPKVA ILCMVVVCWL LITFRQGEIL TITAPVTFHS LPDALTLTRS YPDQVDLQLK SFSNLVSPKQ LDIVVDLDLS KVKEGNNNIQ ISKEQIKLPP GVVVVNIERS LIRVTAERKP SREEKRRR
|
| |