Gene Nmul_A1337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1337 
Symbol 
ID3785063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1526739 
End bp1527905 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content59% 
IMG OID637811425 
Productflagellin-like 
Protein accessionYP_412032 
Protein GI82702466 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCAA TCATCAATAC CAACGTCATT TCCATGAATG CGCAGCGTAA CCTCAACGGT 
TCGCAGAACG CACTCGCCAC CACATTGCAG CGCCTGTCCT CCGGTCTGCG CATCAACAGC
GCCAAGGACG ATGCTGCAGG GCTTGCGATC TCGGAACGCA TGACTTCCCA GATCAAAGGC
TTCACTCAGG CAATTCGCAA TGCCAACGAC GGCATCTCGA TGTCTCAGAC GGCGGAGGGC
GCACTGGGGG AAATCGGCAA CAACCTGCAG CGCATACGTG AGCTGGCCGT GCAATCGCGC
AATGCCAGCA ACAGCGCAAG CGACCGTACC GCGCTCAACA ATGAAGTCCA GCAGCTCAAG
GCCGAAATCG ATCGCGTTGC TTCAACCACG ACGTTCAACG GCATCAAACT GCTTGATGGT
ACTTTCACCA ATCAAGACTT TCAGGTGGGC GCCAATGTGG GCGAGACTAT CAATATTGCG
AGTATCGTCA ATGCACAAAG CTCTGCTCTG GGAACCACTA CCACTTACTC AACCACCGTT
ACCGGTGTAG CCGCTACAGG ATTCGCGACG CCAGCGGATG ACATCGCCGC GGGCGACTTG
AAGATAAACG GGGTTGACGT GGGCGCCATT ACCGCAGGAG GCACAGCACC CCTTCAGGGA
GCAGCCGTCG CAGCTGCCAT CAACCTGATT TCGGGAACCA CAGGGGTTAG CGCTTCTGCC
GATGGCGCCG GTCTGGTGAC ACTGACCAGC ACGTCCAGCG ACGGCATCAC TGTGGCCATG
AGTGGTACGG CTAACACCGC ACGGACCGGT CTGACTGCCG GCGCAACCGC AGCAACGGCA
ACGACCGCCG CTGGCTTCGG CGCGCTGAAG ATAGACACCA CCGCCGATGC CGATACCGCG
ATCGCTTCGA TGGATTCCGC ATTGAGCGCG CTCAACGCGG CGCGTGCCGA TCTCGGCGCC
TACCAGAACC GATTCACATC GGCAGTTGCA AACCTTCAGA CTGTCTCCGA GAACCTGTCC
GCCTCGCGCA GCCGCATCAT GGATGCCGAT TTCGCGGCGG AAACGGCGGC GCTTTCGCGC
AACCAAGTGC TGCAACAAGC GGGAACGGCC ATGCTGGCTC AGGCAAATGC AATGCCGCAA
AGCGTATTGT CCCTGCTGAG AGGTTAA
 
Protein sequence
MAAIINTNVI SMNAQRNLNG SQNALATTLQ RLSSGLRINS AKDDAAGLAI SERMTSQIKG 
FTQAIRNAND GISMSQTAEG ALGEIGNNLQ RIRELAVQSR NASNSASDRT ALNNEVQQLK
AEIDRVASTT TFNGIKLLDG TFTNQDFQVG ANVGETINIA SIVNAQSSAL GTTTTYSTTV
TGVAATGFAT PADDIAAGDL KINGVDVGAI TAGGTAPLQG AAVAAAINLI SGTTGVSASA
DGAGLVTLTS TSSDGITVAM SGTANTARTG LTAGATAATA TTAAGFGALK IDTTADADTA
IASMDSALSA LNAARADLGA YQNRFTSAVA NLQTVSENLS ASRSRIMDAD FAAETAALSR
NQVLQQAGTA MLAQANAMPQ SVLSLLRG