Gene GSU3047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3047 
SymbolflgI 
ID2686555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3350209 
End bp3351315 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content63% 
IMG OID637127740 
Productflagellar basal body P-ring protein 
Protein accessionNP_954089 
Protein GI39998138 
COG category[N] Cell motility 
COG ID[COG1706] Flagellar basal-body P-ring protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.805391 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAAAC CCATGAAACG AATTTTTGTA GTCTTAGTCA TTCTTCTGGT GCTTCCCCAG 
TTGGCCCTGG CGATCAGGAT CAAGGATATC GCCAGTTTTG ACGGGGTACG GGACAACCAG
CTCATCGGCT ACGGCCTCAT CGTGGGCCTG AACGGTACGG GCGACAGCGA CCAGACCAAG
TTTCCGGTCC AGTCCCTGGC CAACGTGCTG GAGCGGATGG GCATCACCGT GAACCGCGAC
GATATCAAGG TGAAGAACGT GGCCGCGGTC ATGGTGACCG CCGAGCTTCC CCCCTTCTCC
AAGCAGGGGA CCAGAGTGGA CGTGCTCGTC TCATCCCTGG GAGACGCCAA GAGCCTTGCC
GGCGGCACGC TGCTCATGAC CCCTCTCAAG GGAGCCGACG GCCAGGTCTA TGCCGTGGCC
CAGGGAGGTC TGCTCACCAA CTCTTTCTCC TACGGCGGCC AGGCGGCAAC GGCCCAGAAA
AATCACCCCA CGGCCGGCCG GATTCCCAAC GGAGCGCTGG TGGAGCGGGA GCTGCCCAAC
GTCCTGGCGG ATCGGTCGCA ACTGCGGCTC AACCTGCACC AGCCGGATTT CACCACGGCC
ACGCGCATCG CCCGGGCGGT CAACGAACAG TTCAAGGCCG GCGTAGCCAG CTGCAATGAT
CCCGGTTCGG TCGTGATCTC CCTCCCCGAC GCCTATCAAG GACGGGTGGT TGAGTTTGTC
GCCGATATGG AGCGCCTCGA GGTTCGCCCC GATAATCCGG CGAAGGTGGT CCTGAACGAA
CGGACCGGCA CCATCGTCAT CGGCGAGAAC GTCCGCATCG ACACCGTTGC GGTCTCCCAT
GGCAACCTGA CTCTCCTGAT CAAGGAAACG CCGAGGGTTT CCCAACCCCA GCCTCTGAGC
CGCACGGGCG AGACCGTCGT AGTGCCTCGC ACCGGCATCA AGGTTTCCGA GGAGAGCGGC
GGATTGGCCG TGTTGCGCGA AGGTGCCAGC ATCGGTGACG TGGTGCGCGC CCTCAATGCC
CTGGGGGTGA CGCCGCGGGA CCTGATCGGC ATTCTCCAGG CAATCAAGGC TGCCGGGGCC
ATGCAGGCAG AACTGTCGGT CATCTGA
 
Protein sequence
MDKPMKRIFV VLVILLVLPQ LALAIRIKDI ASFDGVRDNQ LIGYGLIVGL NGTGDSDQTK 
FPVQSLANVL ERMGITVNRD DIKVKNVAAV MVTAELPPFS KQGTRVDVLV SSLGDAKSLA
GGTLLMTPLK GADGQVYAVA QGGLLTNSFS YGGQAATAQK NHPTAGRIPN GALVERELPN
VLADRSQLRL NLHQPDFTTA TRIARAVNEQ FKAGVASCND PGSVVISLPD AYQGRVVEFV
ADMERLEVRP DNPAKVVLNE RTGTIVIGEN VRIDTVAVSH GNLTLLIKET PRVSQPQPLS
RTGETVVVPR TGIKVSEESG GLAVLREGAS IGDVVRALNA LGVTPRDLIG ILQAIKAAGA
MQAELSVI