Gene Nmul_A1324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1324 
SymbolflgE 
ID3783945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1513087 
End bp1514307 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content55% 
IMG OID637811412 
Productflagellar hook protein FlgE 
Protein accessionYP_412019 
Protein GI82702453 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTTC AGCAGGGTTT AAGTGGTTTG AATGCAGCAT CGAAAAACCT CGATGTCATC 
GGTAATAATG TGGCCAATAC GAATACTGTA GGCTTCAAGC AATCACAGGC ACAGTTTGCG
GATATGTTCG CCAATTCGCT GTCGGGCGGT GGTGGCACGC AGGCCGGCAT AGGTGTCAAG
CTGGCCGGGA TTGCTCAGCA GTTCAGCCAG GGAAGTATTA CCGTATCCAA CAATCCATTC
GACATCGCCA TCAGCGGTGC AGGTTTCTAC CGTTTAAGCG ATCAGGGCAC GATTAGCTAT
TCGCGCAATG GCCAGTTTCA TCCGGACAAG GATGGTTATA TCGTCAACAG CAGTGGCCTC
CGGCTTACCG GCTATATGGC CAACTCGACC GGACAGATCA ATACCGGCAC GCCTACCGAT
TTAAGGCTTT CCACTGCCGA CCTGCCACCT GTCACCACGA CGCGGGTGAA CGCATTGGTC
AATCTCGATT CGCGGGGGGC GCCGTTGAGT GCGGCCGCCT TCGATCTGAT GGATCCGGCG
ACTTATCACA GCTCAACCTC GCTTTCGGTC TATGACAGCC TGGGTAATTC GACTCCGTTG
TCGACATATT TCGTCAAGAC GGCAGCCAAT AGCTGGGATG TATTCGCTGC CAACAATGGC
TCCCTCCTCA ATGGCGGGCT GTCGATTGGC ACATTGAATT TCCTGTCCAA CGGTAGCCTC
GACCCCTTGA GCTCCAGCAG CTTCAATGTG ACGGCTCCCG TCACTACCGG GGCCAGTCCC
CTCGCTTTCG ATATCGATTT CGCAAACACG ACCCAGTTTG GTTCCAATTT CGGTATCAAT
GCGTTATCGC AGGACGGGTA CGCATCCGGG CAGCTCACCG GTTTTTCCAT CGGCGAGGAT
GGGATCGTAA GCGGCAGCTA TTCCAACGGT AAATTTCTCT CAATGGGGCA GATCGCGCTG
GCCAATTTCG CCAATCCCCA AGGTTTGCAA GCAGTCGGCA ATAATACCTG GAAAGAAAGC
GCCGCTTCAG GCGCTGCCCT CGTAGCGGCG CCCGCCACCG GGGGCCTGGG TGTGCTACAG
GCGGGCGCGG TTGAGGATTC AAACGTGGAG CTTACCTCGG AGCTCGTCAA TATGATTACG
GCCCAGCGTG TTTACCAAGC CAATGCCCAG ACGATCAAGA CTCAGGATCA GATACTTCAG
ACAGTGGTGA ACCTGAAGTA A
 
Protein sequence
MSFQQGLSGL NAASKNLDVI GNNVANTNTV GFKQSQAQFA DMFANSLSGG GGTQAGIGVK 
LAGIAQQFSQ GSITVSNNPF DIAISGAGFY RLSDQGTISY SRNGQFHPDK DGYIVNSSGL
RLTGYMANST GQINTGTPTD LRLSTADLPP VTTTRVNALV NLDSRGAPLS AAAFDLMDPA
TYHSSTSLSV YDSLGNSTPL STYFVKTAAN SWDVFAANNG SLLNGGLSIG TLNFLSNGSL
DPLSSSSFNV TAPVTTGASP LAFDIDFANT TQFGSNFGIN ALSQDGYASG QLTGFSIGED
GIVSGSYSNG KFLSMGQIAL ANFANPQGLQ AVGNNTWKES AASGAALVAA PATGGLGVLQ
AGAVEDSNVE LTSELVNMIT AQRVYQANAQ TIKTQDQILQ TVVNLK