Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU2032 |
Symbol | |
ID | 2686064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 2224395 |
End bp | 2225450 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637126723 |
Product | type IV pilus biogenesis protein PilM |
Protein accession | NP_953081 |
Protein GI | 39997130 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4972] Tfp pilus assembly protein, ATPase PilM |
TIGRFAM ID | [TIGR01174] cell division protein FtsA [TIGR01175] type IV pilus assembly protein PilM |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTTTCT CCAAAAAGAA AGAAATCGTC GGCATCGATA TCGGCTCCAG CTCCGTGAAG CTCGTTCAGC TCAAGGAGCA GAAGGGGGGA TGGCAGCTGG TCAATATCGG CATCCAGCCC CTGCCTCCCG AGGCCATTGT CGATAACACT CTCATGGATA GTTCATCCGT GATCGAGGCG GTCAAGGGGC TCATGAAGGG GCTCTCCGTC AAGGTCAAGG ATGTGGCGTG CTCCATTTCC GGGAACACGG TCATCATCCG CAAGATCAAG CTCCCGGCCA TGACCCCGGA AGAACTGGAA GACCAGATCC AGTGGGAAGC GGAGCAGTAC ATCCCCTTCG ATATCAACGA TGTGAACATT GACTTCCAGA TTCTGGAGCC CGACGAGGAC GACCCGTCCC GCATGAATGT CCTTCTCGTG GCGAGCAAGA AGGAAATCAT CAACGATTAC GTCAATGTGT TTGCCGAGAC AGGTCTCAAA CTCGTGATCG TCGATGTGGA TTCCTTTGCC GTCCAGAACG CCTTCGAACT CAACTACGAA ACCGATCCCG AAGAGGTTGT GGCGCTTATC AATGTCGGCG CAAGTATTCT GAACCTCAAT ATCGTCAGGG GAGGGAGCTC CCTCTTTACC CGGGATGTAC AGGTGGGCGG AAATCTCTTC ACTGAGGAGA TCCAGAAGCA GTTCGCCTTG AGCAGCGAAG AAGCCGAGCA GGTAAAGATC ACCGGCGAGT ACCCCGACAA GGCCAAGCTG AAGGATGTCA TCGCCCGCGT TAACGAAACC CTTGCCGTGG AGATGCGCCG TTCGCTCGAT TTTTACAACA CAACCGCGGG CGAAGGGCGA ATCGCCAGGG TATACCTGAG TGGAGGCGCA GCCAAGACCG CCATGCTTGC TGAAACCGTG CAGAACAAGC TGGGCGTTCC GGTTGAGATG CTGGACCCAC TTACGAAAAT CACCTGCAGC GAGAAAGAGT TCGATCCCGA GTACCTTCGG GAGATCGGGC CGCTCGTGAC GGTTGCCGTG GGGCTGGCCA CGAGGAGGGT GGGTGACAAA TGGTAA
|
Protein sequence | MLFSKKKEIV GIDIGSSSVK LVQLKEQKGG WQLVNIGIQP LPPEAIVDNT LMDSSSVIEA VKGLMKGLSV KVKDVACSIS GNTVIIRKIK LPAMTPEELE DQIQWEAEQY IPFDINDVNI DFQILEPDED DPSRMNVLLV ASKKEIINDY VNVFAETGLK LVIVDVDSFA VQNAFELNYE TDPEEVVALI NVGASILNLN IVRGGSSLFT RDVQVGGNLF TEEIQKQFAL SSEEAEQVKI TGEYPDKAKL KDVIARVNET LAVEMRRSLD FYNTTAGEGR IARVYLSGGA AKTAMLAETV QNKLGVPVEM LDPLTKITCS EKEFDPEYLR EIGPLVTVAV GLATRRVGDK W
|
| |