Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3652 |
Symbol | |
ID | 8139026 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4229877 |
End bp | 4231046 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644871273 |
Product | Baseplate J family protein |
Protein accession | YP_003023431 |
Protein GI | 253702242 |
COG category | [S] Function unknown |
COG ID | [COG3299] Uncharacterized homolog of phage Mu protein gp47 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 174 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAATCC AAGATCTGGT CTCTAAATCG CTCGATACCA TCCGCCAAGA AATGTTTGAC CGCATTGCGG CGGTCCAGGA TGAATATTCC GCCAAGGGGT GGCTCCCCAT CCGGTTGAAC CTGAACAAGG GGATCGTGCG CGGCATGATC GAGCTCTGGT GCTGGGGTCT ATGGCAGCTG TACCAGTTCC TCGCCCTGGT GCTGAAACAA GCCTTTCCGG ACACTGCCAC CGGGGTCTGG CTAGACCTGC ATTGCAAACA AGTCGGCGTT GCTCGGAGAG AGGCCACCAA GGCGGTCGGC GTCGTCTATT TCACACGTGC CGGCATGGTC GGAAACGTCC CGATTCCCGC CGGGCGCGTG GTCCGCACCA AGCCGGACGG CAACGGCCTT ATATATAGAT ACGTGACCAC GGCAGCGGCG GTGCTTTTGA ACGGGGCGAC CGAGGTGGCC GTGGCGGTCG AGGCGGAAGA ATACGGCGCG GCCGCCAACG CGACGGTCGG GCAGATCTCC GAGATCGTGA CGGTGATCCC CGGCGTCGAC GCGGTGGAGA ATAGGGCCGA CTGGATCACC AGAGAGGGGA GCGACCAAGA GAAGGACGAG AGCCTCCGCG AGCGCTACCA GCTGGCCTGG AAGGTGCTGA ACGGCTGCAC CAAGTACGCC TATGAGGCAT GGGCCAAAGA AGTGGTTGGC GTAGTCGCGG TCAAGATCAG GGACCAGCAC CCCCGGGGCG AGGGGACGGT CGACGTGGTC ATAGTGGGGA GCGCCGGCGC GCCGACTCCG GCATTGCTTG CCTCAGTCGA TGCCAACATC AACGGCACGG GGAACGACGA CGAGAAGAAC CCGATCAACG ACGATGTGCT GGTAGCCGGC GCTGACCTAG TGGCCACCAG CCTCGTCGCG CAGCTGGAGC TCAGCTACGG CGACCCGGCT GCGCTTCTGC TTGAGGCGGA AAACCGGGTG CGGGCGCTTT ACTCCACAGT GGCCTCTGTT GCCGGGGTCG TGCCCTTCGG GATCGGCGGG GACGTGACGC GCGACCGGCT GGTGTGGGCG ATGATGCTGC CCGGCGTAAA GCGGGTCAAC ATGGTGTTCG CGGACGTGGC GGTACCGGAG TACGGGCTTG CCACATTGAC CGATCTCACC CTCACCTACG TGCTGGCCGC AGAGGCATAA
|
Protein sequence | MSIQDLVSKS LDTIRQEMFD RIAAVQDEYS AKGWLPIRLN LNKGIVRGMI ELWCWGLWQL YQFLALVLKQ AFPDTATGVW LDLHCKQVGV ARREATKAVG VVYFTRAGMV GNVPIPAGRV VRTKPDGNGL IYRYVTTAAA VLLNGATEVA VAVEAEEYGA AANATVGQIS EIVTVIPGVD AVENRADWIT REGSDQEKDE SLRERYQLAW KVLNGCTKYA YEAWAKEVVG VVAVKIRDQH PRGEGTVDVV IVGSAGAPTP ALLASVDANI NGTGNDDEKN PINDDVLVAG ADLVATSLVA QLELSYGDPA ALLLEAENRV RALYSTVASV AGVVPFGIGG DVTRDRLVWA MMLPGVKRVN MVFADVAVPE YGLATLTDLT LTYVLAAEA
|
| |