Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_0736 |
Symbol | |
ID | 8136051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 879255 |
End bp | 880253 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644868353 |
Product | protein of unknown function DUF534 |
Protein accession | YP_003020568 |
Protein GI | 253699379 |
COG category | [R] General function prediction only |
COG ID | [COG2984] ABC-type uncharacterized transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.00000000204583 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAGCAAGA ATTTTCGTAG CGTGATGCTC TGTTTCGCTT TGTCCCTGGT CTGCGCCGCA ACGGCTTTCG CGGCGGCCCC CTCCAAGCCG GTGCTGATCG GCATCTCGAA GATAGTTTCC CATCCGGCGC TCGACTCGGT GGTCAAGGGG GTTCAGGACG AATTGAAGGA CGCCAGGGTC AACGCGATCT TCGACGTGCA AAACGCCAAC GGCGACATCA ACACCGCGGC CTCCATCGCC AACAAGTTCC GGTCCCAGAA GGTGAACCTC GCCGTCGGCG TCGCCACTCC GACCGCCCAG GCCCTGGTTA ATACGCTCAA GGGGATCCCC ATCGTCTACT CCGCGGTCAC CGATCCGGTG AAGGCGGGCC TCGTTCCCTC CCTCGCCAAG GGGGGCAAGA ACGTAACCGG CGTATCCGAC ATGACTCCGG TCCGGCAGCA GATCGAGATG CTGCTCAGGA TCAAGCCCAA GACCAAGCGC ATCGGCCACA TCTACACGAG CTCCGAGGAG AACGCCGTGG TTCTTGCCGC AATGGTGAAG CAGGTGTGCA AAGAGAAGAA GCTCGAATTC GTGGAGACCA CCGTCACCAA GTCGGCAGAG GTGAAGCAGG CGGCCCAGGC GATCGCGCAC CGCGTCGACG CCTTTTACAT CAGCACCGAC AACACCGTGG TCTCCGCCAT GAGCGCGGTG GCGGATGTGG CGAAAAAGGC GAAGATCCCC ATCATGTCCG CCGACCCGAG CTCCTCCGAG ACCTATGACG TCCTCGCCGC CTGGGGCTTC GACTACTACA AGATGGGGCG CGCCACCGGC AAGGTTGTGA TCGAGATCCT GAAGGGCAAG AAGCCCGAGC AGATCCCGAC CCGCTTCATG ACCAAGGCCT CCGACGTCGA CCTGCTGATC AACCTCGACG TGGCCAAGAA GCTCGGCCTC ACCGTCCCGG CGGACATCGT GAAGAGCGCG AAGACCATAC GCCAGAACGG CAAATTGACC AAGAAGTAA
|
Protein sequence | MSKNFRSVML CFALSLVCAA TAFAAAPSKP VLIGISKIVS HPALDSVVKG VQDELKDARV NAIFDVQNAN GDINTAASIA NKFRSQKVNL AVGVATPTAQ ALVNTLKGIP IVYSAVTDPV KAGLVPSLAK GGKNVTGVSD MTPVRQQIEM LLRIKPKTKR IGHIYTSSEE NAVVLAAMVK QVCKEKKLEF VETTVTKSAE VKQAAQAIAH RVDAFYISTD NTVVSAMSAV ADVAKKAKIP IMSADPSSSE TYDVLAAWGF DYYKMGRATG KVVIEILKGK KPEQIPTRFM TKASDVDLLI NLDVAKKLGL TVPADIVKSA KTIRQNGKLT KK
|
| |