Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1324 |
Symbol | flgE |
ID | 3783945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 1513087 |
End bp | 1514307 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637811412 |
Product | flagellar hook protein FlgE |
Protein accession | YP_412019 |
Protein GI | 82702453 |
COG category | [N] Cell motility |
COG ID | [COG1749] Flagellar hook protein FlgE |
TIGRFAM ID | [TIGR03506] fagellar hook-basal body proteins |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTTTC AGCAGGGTTT AAGTGGTTTG AATGCAGCAT CGAAAAACCT CGATGTCATC GGTAATAATG TGGCCAATAC GAATACTGTA GGCTTCAAGC AATCACAGGC ACAGTTTGCG GATATGTTCG CCAATTCGCT GTCGGGCGGT GGTGGCACGC AGGCCGGCAT AGGTGTCAAG CTGGCCGGGA TTGCTCAGCA GTTCAGCCAG GGAAGTATTA CCGTATCCAA CAATCCATTC GACATCGCCA TCAGCGGTGC AGGTTTCTAC CGTTTAAGCG ATCAGGGCAC GATTAGCTAT TCGCGCAATG GCCAGTTTCA TCCGGACAAG GATGGTTATA TCGTCAACAG CAGTGGCCTC CGGCTTACCG GCTATATGGC CAACTCGACC GGACAGATCA ATACCGGCAC GCCTACCGAT TTAAGGCTTT CCACTGCCGA CCTGCCACCT GTCACCACGA CGCGGGTGAA CGCATTGGTC AATCTCGATT CGCGGGGGGC GCCGTTGAGT GCGGCCGCCT TCGATCTGAT GGATCCGGCG ACTTATCACA GCTCAACCTC GCTTTCGGTC TATGACAGCC TGGGTAATTC GACTCCGTTG TCGACATATT TCGTCAAGAC GGCAGCCAAT AGCTGGGATG TATTCGCTGC CAACAATGGC TCCCTCCTCA ATGGCGGGCT GTCGATTGGC ACATTGAATT TCCTGTCCAA CGGTAGCCTC GACCCCTTGA GCTCCAGCAG CTTCAATGTG ACGGCTCCCG TCACTACCGG GGCCAGTCCC CTCGCTTTCG ATATCGATTT CGCAAACACG ACCCAGTTTG GTTCCAATTT CGGTATCAAT GCGTTATCGC AGGACGGGTA CGCATCCGGG CAGCTCACCG GTTTTTCCAT CGGCGAGGAT GGGATCGTAA GCGGCAGCTA TTCCAACGGT AAATTTCTCT CAATGGGGCA GATCGCGCTG GCCAATTTCG CCAATCCCCA AGGTTTGCAA GCAGTCGGCA ATAATACCTG GAAAGAAAGC GCCGCTTCAG GCGCTGCCCT CGTAGCGGCG CCCGCCACCG GGGGCCTGGG TGTGCTACAG GCGGGCGCGG TTGAGGATTC AAACGTGGAG CTTACCTCGG AGCTCGTCAA TATGATTACG GCCCAGCGTG TTTACCAAGC CAATGCCCAG ACGATCAAGA CTCAGGATCA GATACTTCAG ACAGTGGTGA ACCTGAAGTA A
|
Protein sequence | MSFQQGLSGL NAASKNLDVI GNNVANTNTV GFKQSQAQFA DMFANSLSGG GGTQAGIGVK LAGIAQQFSQ GSITVSNNPF DIAISGAGFY RLSDQGTISY SRNGQFHPDK DGYIVNSSGL RLTGYMANST GQINTGTPTD LRLSTADLPP VTTTRVNALV NLDSRGAPLS AAAFDLMDPA TYHSSTSLSV YDSLGNSTPL STYFVKTAAN SWDVFAANNG SLLNGGLSIG TLNFLSNGSL DPLSSSSFNV TAPVTTGASP LAFDIDFANT TQFGSNFGIN ALSQDGYASG QLTGFSIGED GIVSGSYSNG KFLSMGQIAL ANFANPQGLQ AVGNNTWKES AASGAALVAA PATGGLGVLQ AGAVEDSNVE LTSELVNMIT AQRVYQANAQ TIKTQDQILQ TVVNLK
|
| |