Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3913 |
Symbol | |
ID | 8139287 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4499043 |
End bp | 4499927 |
Gene Length | 885 bp |
Protein Length | 294 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644871530 |
Product | protein of unknown function DUF1078 domain protein |
Protein accession | YP_003023688 |
Protein GI | 253702499 |
COG category | [N] Cell motility |
COG ID | [COG4786] Flagellar basal body rod protein |
TIGRFAM ID | [TIGR03506] fagellar hook-basal body proteins |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 0.0335903 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGTAA CTTCCGCACT GTACACCGGC GTCAGCGGCC TGCTTCAAAA CGGCGAGGCC ATGAACGTGA TCGGCAACAA CATATCCAAC GTCAACACCA TAGGCTACAA GGGTGCGCGC ACCCTCTTCT CGGACATGCT TTCCCAGAAC GTCGGCAACA ACTCGCAGAT CGGCAAAGGG GTGCAGATGC AGGTGGTGCA GAACGTCTTC TCCCAGGGCT CCACCCAGTC CTCCGAGAAC GTCACCGACC TCGCCATCCA GGGGAACAGC TTCTTCGCCC TGAAGCCCTC GACCGCGGCC GCCCCGGTGG CAAGGCAAAG CGACGCCTTC CTCTCCCGCG CCGGCGCCTT CCAAACCGAC AGCAACCTCT ACCTGGTGAA CCCGGACGGC TACCAGGTGC TCGACACCGC CGGCAACCCG ATCCAGTTCC TCGACACCAA AGCCGCCCCC ACCACAGACT TCGGCAAGGT GCTGAGCATC GACAACACCG GCCTGATCAC CTACCTCGCC ACCGACGGGA TCACCCAGAA CTACTACAGC GCCTCTGGCG CGGTAGGGGT CGCCACCACC CCCGCCGCCG CGACCGCCGC GGAGAAGATC GCCGTGGTCA CCGCTGCTGA CACCACCGGG CTGAAGAAGG TCGGCGGCTC GCTGTACCAG GCGACCACCG ATGCCGGCGT CTCGACCGCG GCCTTCTCCC AGGCAGCCAA CAAGCCCAAC GGGGTCAGCG AGACCATCCT CTCCAACACC CTGGAGCAGA GCACCGTCGA CCTGGCGAAC GAGTTCGTCA AGATGATCAC CACCCAGAGG GCCTACTCGG CCAACTCCAA GACCATCACC ACCTCCGACC AGATGACGCA GGAAGTGCTG AACCTGATCC GTTAA
|
Protein sequence | MSVTSALYTG VSGLLQNGEA MNVIGNNISN VNTIGYKGAR TLFSDMLSQN VGNNSQIGKG VQMQVVQNVF SQGSTQSSEN VTDLAIQGNS FFALKPSTAA APVARQSDAF LSRAGAFQTD SNLYLVNPDG YQVLDTAGNP IQFLDTKAAP TTDFGKVLSI DNTGLITYLA TDGITQNYYS ASGAVGVATT PAAATAAEKI AVVTAADTTG LKKVGGSLYQ ATTDAGVSTA AFSQAANKPN GVSETILSNT LEQSTVDLAN EFVKMITTQR AYSANSKTIT TSDQMTQEVL NLIR
|
| |