Gene GM21_3843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3843 
SymbolflgI 
ID8139217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4427187 
End bp4428383 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content65% 
IMG OID644871460 
Productflagellar basal body P-ring protein 
Protein accessionYP_003023618 
Protein GI253702429 
COG category[N] Cell motility 
COG ID[COG1706] Flagellar basal-body P-ring protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value0.00648297 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCGTTC TAGAGCATGG TTATTTGAGG GGCGAGGTCG CGGGCGTTGT CAAGAAATCG 
ACACAGGCGC CGAAGAGTGT TCAGGTGGAA CCAGTGAAAG TTAATCTCGG ATGGAAAAGT
TTATTGCTGC TGGTCCTTTT GCTGCTGCCG CAGCTAGCCT TCGGCGCGCG CATCAAGGAC
ATCGCGGCGT TCGACGGCGT CAGGGAGAAC CAGCTGATCG GCTACGGCCT CGTGGTCGGC
CTGAACGGTT CCGGCGACTC GGACCAGACC AAGTTCCCGG TGCAGTCTCT GGTCGGCGCC
TTGGAGCGGA TGGGGATCAC CGTTAACCGC AACGACATCA CGGTGAAGAA CATCGCGTCG
GTCATGGTCA CGGCCCAGCT CCCCCCCTTC GCCAAGCAGG GTAACCGGCT CGACGTCCTG
GTCTCCTCCA TGGGCGACGC CAAGAGCCTG GCCGGCGGCA CCCTGATGAT GGCCCCCTTG
AAGGGTGCGG ACAACCAGGT CTACGCCGTG GCGCAGGGGG CGGTCCTGAC CAACTCCTTC
TCCTACGGAG GCCAGGCGGC GAGCGCCATG AAGAACCACC CGACGGCGGG GACGGTCCCG
GGGGGGGCGC TCATCGAGCG CGAGATCCCG AACGTCTTGG CCAGCCGCAG CCAACTGAAG
CTCAACCTGC ACCAGTCCGA CTTCACCACC GCCTCCCGGG TGGCGAGCGC CATCAACGAG
CGCTTTCAGG GACAGGTGGC GACCCTCACC GACCCGGGGA GCGTGCAGAT CGCGGTGCCG
GCCGAGTACC GGAACCGGGT GGTCGAATTC GTCGCCAACC TGGAGCGGCT CGAAGTGAAC
CCCGACGTAT TGGCGCGGGT GGTGATGAAC GAGCGGACCG GCACCATCGT CATGGGTGAG
AACGTCCGTA TCTCGACCGT GGCGGTATCG CACGGCAACC TGACCGTCGT GATCAAGGAG
TCCCCCAAGG TCTCCCAGCC GAGGGCTTTG GCCCAGGGGA CCACCACGGT AGTGCCGAGG
ACGGAGCTGA GGGTGGCCGA GGAGAAGGTG AACCTATCGA TGGTCAGGGA AGGGGCCAAC
CTGGGAGAGG TGGTGCGCGC CCTGAACACC CTGGGGGTAA CGCCCAGGGA CCTGCTCGGC
ATCATGCAGG CGATCAAGGC CGCAGGAGCC TTGAACGCCG AGCTGAGCGT GATGTAG
 
Protein sequence
MAVLEHGYLR GEVAGVVKKS TQAPKSVQVE PVKVNLGWKS LLLLVLLLLP QLAFGARIKD 
IAAFDGVREN QLIGYGLVVG LNGSGDSDQT KFPVQSLVGA LERMGITVNR NDITVKNIAS
VMVTAQLPPF AKQGNRLDVL VSSMGDAKSL AGGTLMMAPL KGADNQVYAV AQGAVLTNSF
SYGGQAASAM KNHPTAGTVP GGALIEREIP NVLASRSQLK LNLHQSDFTT ASRVASAINE
RFQGQVATLT DPGSVQIAVP AEYRNRVVEF VANLERLEVN PDVLARVVMN ERTGTIVMGE
NVRISTVAVS HGNLTVVIKE SPKVSQPRAL AQGTTTVVPR TELRVAEEKV NLSMVREGAN
LGEVVRALNT LGVTPRDLLG IMQAIKAAGA LNAELSVM