Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3271 |
Symbol | |
ID | 8138628 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 3802343 |
End bp | 3803803 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644870880 |
Product | protein of unknown function UPF0027 |
Protein accession | YP_003023055 |
Protein GI | 253701866 |
COG category | [S] Function unknown |
COG ID | [COG1690] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 119 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGTAC CGGCGGCGCT AAAGCGGATC ACAGATCAGT TGTGGGAACT ACCGGTAAGC TACAAGGAAG GGATGCTGGT CCCCGCCCGA ATCTTTGCCT CAGAGAAATT GGTCCGGGAG ATGGATGCCG GCGTTTTCGA GCAGGTCAGC AACGTCGCCA CGCTCCCCGG CATCCAGAGA TACGCCTACT GCATGCCCGA CGGCCACTGG GGCTACGGCT TTCCCATAGG GGGTGTAGCC GCCATGGATC CGGGTACCGG CGTCATCTCG CCGGGAGGGA TCGGTTTCGA CATAAACTGC GGCATGCGGC TCGTATTGAC GAACCTCACC GCCGACCAGG TCATCCCCAA ACTGCATCAA CTGGTCGATC GTCTCTTCGC CCGGATACCC ACCGGCGTCG GATGTCATGG GTTCGTGAAG CTGAAGCAGG ACGATTTTCG TTCCATAGTG CAGCAGGGTT CGCGCTGGTG CCTGAAAAAC GGCTTCGCTA CCCAGGAAGA TCTGGATATG ACCGAGGAAG GGGGCTGCTT TTCCGGCGCC GACGCCTCAC ACATAAGCGA CAAAGCGGTG GAACGCGGCT ACAACCAGCT CGGCACACTG GGGTCCGGCA ACCACTACTG CGAGATCCAG GTGGTGAAGC CTGAGAACGT CATGGACGCG GAATTGGCCG CAGCCTTCGG GCTTACCATG GTACCCAACC AGGTGGTGAT CATGTTCCAT TGCGGCAGCA GGGGCTTCGG GCACCAGGTG GCGACGGACT ACCTGAAGCT GTTCCTCTCC GTCATGGGGC GCAAGTACGG CATAAAGATC GTCGACCGCG AACTTGCCTG CGCTCCTTTT CACTCGCCCG AAGGTCAGGC CTACTTCAGC GCGATGAAGT GTGCCGTCAA CATGGCCTTT GCCAACAGGC AGGTGATCCT GCACCGGATC AGGGAGGTGT TTTCCGACCT GTTCCACGCC TCGCCCGACG AACTCGGGCT GCGCATGGTG TACGACGTGG CGCACAACAC GGCAAAGCTG GAACGGCACG AGGTAAACGG GACCCGGAAG GAACTCCTGG TGCACCGCAA AGGATCCACC CGCGCCTTCG GCCCTGGCGC TGCAGGGCTA CCCGGATGTT ACGCGAAGAC CGGCCAGCCT GTCATCATAG GCGGGAGCAT GGAGACCGGC TCCTATCTGC TCGCGGGGAT GCAAAGCGGC GCCGACGCCT TCTTCACCAC CGCCCACGGC AGCGGCAGGA CCATGAGCAG ACATGAGGCG AAGAAAAATT TCAGGGGCGA CAAGCTGCAG CGTGAAATGG AGGCGCGGGG GATCTACGTC CGCACCGACT CGTTCGGCGG GTTGGCGGAG GAAGCGGGAC CCGCATATAA GAATATAGAC GAGGTCGTTG AAGCCACCGA ACTGGCCGGC TTGAGCAAGA GGGTGGCGCG CCTGGTTCCG ATCGGCAACA TCAAGGGGTA G
|
Protein sequence | MNVPAALKRI TDQLWELPVS YKEGMLVPAR IFASEKLVRE MDAGVFEQVS NVATLPGIQR YAYCMPDGHW GYGFPIGGVA AMDPGTGVIS PGGIGFDINC GMRLVLTNLT ADQVIPKLHQ LVDRLFARIP TGVGCHGFVK LKQDDFRSIV QQGSRWCLKN GFATQEDLDM TEEGGCFSGA DASHISDKAV ERGYNQLGTL GSGNHYCEIQ VVKPENVMDA ELAAAFGLTM VPNQVVIMFH CGSRGFGHQV ATDYLKLFLS VMGRKYGIKI VDRELACAPF HSPEGQAYFS AMKCAVNMAF ANRQVILHRI REVFSDLFHA SPDELGLRMV YDVAHNTAKL ERHEVNGTRK ELLVHRKGST RAFGPGAAGL PGCYAKTGQP VIIGGSMETG SYLLAGMQSG ADAFFTTAHG SGRTMSRHEA KKNFRGDKLQ REMEARGIYV RTDSFGGLAE EAGPAYKNID EVVEATELAG LSKRVARLVP IGNIKG
|
| |