Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3843 |
Symbol | flgI |
ID | 8139217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4427187 |
End bp | 4428383 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644871460 |
Product | flagellar basal body P-ring protein |
Protein accession | YP_003023618 |
Protein GI | 253702429 |
COG category | [N] Cell motility |
COG ID | [COG1706] Flagellar basal-body P-ring protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 0.00648297 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCGTTC TAGAGCATGG TTATTTGAGG GGCGAGGTCG CGGGCGTTGT CAAGAAATCG ACACAGGCGC CGAAGAGTGT TCAGGTGGAA CCAGTGAAAG TTAATCTCGG ATGGAAAAGT TTATTGCTGC TGGTCCTTTT GCTGCTGCCG CAGCTAGCCT TCGGCGCGCG CATCAAGGAC ATCGCGGCGT TCGACGGCGT CAGGGAGAAC CAGCTGATCG GCTACGGCCT CGTGGTCGGC CTGAACGGTT CCGGCGACTC GGACCAGACC AAGTTCCCGG TGCAGTCTCT GGTCGGCGCC TTGGAGCGGA TGGGGATCAC CGTTAACCGC AACGACATCA CGGTGAAGAA CATCGCGTCG GTCATGGTCA CGGCCCAGCT CCCCCCCTTC GCCAAGCAGG GTAACCGGCT CGACGTCCTG GTCTCCTCCA TGGGCGACGC CAAGAGCCTG GCCGGCGGCA CCCTGATGAT GGCCCCCTTG AAGGGTGCGG ACAACCAGGT CTACGCCGTG GCGCAGGGGG CGGTCCTGAC CAACTCCTTC TCCTACGGAG GCCAGGCGGC GAGCGCCATG AAGAACCACC CGACGGCGGG GACGGTCCCG GGGGGGGCGC TCATCGAGCG CGAGATCCCG AACGTCTTGG CCAGCCGCAG CCAACTGAAG CTCAACCTGC ACCAGTCCGA CTTCACCACC GCCTCCCGGG TGGCGAGCGC CATCAACGAG CGCTTTCAGG GACAGGTGGC GACCCTCACC GACCCGGGGA GCGTGCAGAT CGCGGTGCCG GCCGAGTACC GGAACCGGGT GGTCGAATTC GTCGCCAACC TGGAGCGGCT CGAAGTGAAC CCCGACGTAT TGGCGCGGGT GGTGATGAAC GAGCGGACCG GCACCATCGT CATGGGTGAG AACGTCCGTA TCTCGACCGT GGCGGTATCG CACGGCAACC TGACCGTCGT GATCAAGGAG TCCCCCAAGG TCTCCCAGCC GAGGGCTTTG GCCCAGGGGA CCACCACGGT AGTGCCGAGG ACGGAGCTGA GGGTGGCCGA GGAGAAGGTG AACCTATCGA TGGTCAGGGA AGGGGCCAAC CTGGGAGAGG TGGTGCGCGC CCTGAACACC CTGGGGGTAA CGCCCAGGGA CCTGCTCGGC ATCATGCAGG CGATCAAGGC CGCAGGAGCC TTGAACGCCG AGCTGAGCGT GATGTAG
|
Protein sequence | MAVLEHGYLR GEVAGVVKKS TQAPKSVQVE PVKVNLGWKS LLLLVLLLLP QLAFGARIKD IAAFDGVREN QLIGYGLVVG LNGSGDSDQT KFPVQSLVGA LERMGITVNR NDITVKNIAS VMVTAQLPPF AKQGNRLDVL VSSMGDAKSL AGGTLMMAPL KGADNQVYAV AQGAVLTNSF SYGGQAASAM KNHPTAGTVP GGALIEREIP NVLASRSQLK LNLHQSDFTT ASRVASAINE RFQGQVATLT DPGSVQIAVP AEYRNRVVEF VANLERLEVN PDVLARVVMN ERTGTIVMGE NVRISTVAVS HGNLTVVIKE SPKVSQPRAL AQGTTTVVPR TELRVAEEKV NLSMVREGAN LGEVVRALNT LGVTPRDLLG IMQAIKAAGA LNAELSVM
|
| |