Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_4104 |
Symbol | |
ID | 8139478 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 4685786 |
End bp | 4686772 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644871719 |
Product | hypothetical protein |
Protein accession | YP_003023877 |
Protein GI | 253702688 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 119 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACCC TGCAGATCAG TTACCTGCTG GCCGCGCTGT CAGTCCCCGT GGTGCTCCAT TTTCATCTCC TTCCCGCCGC GTTCGCGGGG CTCGCGGTGT ACGTGCTGAC CGCGAAGTTC GCCCCCATGC TTCCGCTGCG CTGGGGGAGC CTGACCCAGA AGGTCGCCCT GGCGGGGATC ATCCTGTTCG TCATCGCCCT GGTCTCCAGC ATCTGCCTCG GGCTCTGGTC GTTTTTGCGC GGCCACCACG GGATGGCGAA CCTGCTCAAC ATGGCGGCCG AAACCCTGGA AAACCTGAAG CGGACCCTCC CGGAGGACCT CACCGCGGCG CTACCCGACA CCGTCGAGGA GCTGCGCGAG CAGATAACCA ACATGCTGCG GGAGCATGGG AGGAATATCT CCTCCGTCGG CATCTCCGGA GTCAAGACCT TCGCCCACCT AGTCCTCGGC ATGGTGGTGG GAAGCCTCGC GGTGCTGCAC CGCTTCAACA AGGAAAACGG GTTCCCCCCC CTGGCGTCCG ACCTGCACGC GAGACTGATC AACCTGGCCG ATTCCTTCGA CAAGGTGGTC TTCGCCCAGG TGAAGATCTC CGCGCTCAAC ACCGTTCTCA CCGCGATCTA CCTCGCCCTG GTGCTCCCCA TTTTCGGGAT CTACCTCCCC ATGGTCACCC TCCTGGTCCT GCTCACCTTC GTGGCCGGCC TGCTCCCGGT GGTGGGTAAC CTCATCTCCA ACTTCACCAT CGTGCTGATC AGCCTCGGCG TCTCCCCCAT GGTGGGTGTC GCCTCGCTGG CGTTCCTGAT CGTGATTCAC AAGCTGGAGT ACTTCACCAA CGCGAGGATC GTGGGGGGCG AGGTGAAGGC CAGCGTCTGG GAGCTTTTGT GCGCCATGCT CTTCATGGAA GCGATCTTCG GCATGGCCGG TCTGGTGGCC GCCCCCGTGG TTTACGCCTG GCTCAAGGCC GAACTCAAGG CGAAGCGGCT GGTATAA
|
Protein sequence | MNTLQISYLL AALSVPVVLH FHLLPAAFAG LAVYVLTAKF APMLPLRWGS LTQKVALAGI ILFVIALVSS ICLGLWSFLR GHHGMANLLN MAAETLENLK RTLPEDLTAA LPDTVEELRE QITNMLREHG RNISSVGISG VKTFAHLVLG MVVGSLAVLH RFNKENGFPP LASDLHARLI NLADSFDKVV FAQVKISALN TVLTAIYLAL VLPIFGIYLP MVTLLVLLTF VAGLLPVVGN LISNFTIVLI SLGVSPMVGV ASLAFLIVIH KLEYFTNARI VGGEVKASVW ELLCAMLFME AIFGMAGLVA APVVYAWLKA ELKAKRLV
|
| |