Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1337 |
Symbol | |
ID | 3785063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 1526739 |
End bp | 1527905 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637811425 |
Product | flagellin-like |
Protein accession | YP_412032 |
Protein GI | 82702466 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGCAA TCATCAATAC CAACGTCATT TCCATGAATG CGCAGCGTAA CCTCAACGGT TCGCAGAACG CACTCGCCAC CACATTGCAG CGCCTGTCCT CCGGTCTGCG CATCAACAGC GCCAAGGACG ATGCTGCAGG GCTTGCGATC TCGGAACGCA TGACTTCCCA GATCAAAGGC TTCACTCAGG CAATTCGCAA TGCCAACGAC GGCATCTCGA TGTCTCAGAC GGCGGAGGGC GCACTGGGGG AAATCGGCAA CAACCTGCAG CGCATACGTG AGCTGGCCGT GCAATCGCGC AATGCCAGCA ACAGCGCAAG CGACCGTACC GCGCTCAACA ATGAAGTCCA GCAGCTCAAG GCCGAAATCG ATCGCGTTGC TTCAACCACG ACGTTCAACG GCATCAAACT GCTTGATGGT ACTTTCACCA ATCAAGACTT TCAGGTGGGC GCCAATGTGG GCGAGACTAT CAATATTGCG AGTATCGTCA ATGCACAAAG CTCTGCTCTG GGAACCACTA CCACTTACTC AACCACCGTT ACCGGTGTAG CCGCTACAGG ATTCGCGACG CCAGCGGATG ACATCGCCGC GGGCGACTTG AAGATAAACG GGGTTGACGT GGGCGCCATT ACCGCAGGAG GCACAGCACC CCTTCAGGGA GCAGCCGTCG CAGCTGCCAT CAACCTGATT TCGGGAACCA CAGGGGTTAG CGCTTCTGCC GATGGCGCCG GTCTGGTGAC ACTGACCAGC ACGTCCAGCG ACGGCATCAC TGTGGCCATG AGTGGTACGG CTAACACCGC ACGGACCGGT CTGACTGCCG GCGCAACCGC AGCAACGGCA ACGACCGCCG CTGGCTTCGG CGCGCTGAAG ATAGACACCA CCGCCGATGC CGATACCGCG ATCGCTTCGA TGGATTCCGC ATTGAGCGCG CTCAACGCGG CGCGTGCCGA TCTCGGCGCC TACCAGAACC GATTCACATC GGCAGTTGCA AACCTTCAGA CTGTCTCCGA GAACCTGTCC GCCTCGCGCA GCCGCATCAT GGATGCCGAT TTCGCGGCGG AAACGGCGGC GCTTTCGCGC AACCAAGTGC TGCAACAAGC GGGAACGGCC ATGCTGGCTC AGGCAAATGC AATGCCGCAA AGCGTATTGT CCCTGCTGAG AGGTTAA
|
Protein sequence | MAAIINTNVI SMNAQRNLNG SQNALATTLQ RLSSGLRINS AKDDAAGLAI SERMTSQIKG FTQAIRNAND GISMSQTAEG ALGEIGNNLQ RIRELAVQSR NASNSASDRT ALNNEVQQLK AEIDRVASTT TFNGIKLLDG TFTNQDFQVG ANVGETINIA SIVNAQSSAL GTTTTYSTTV TGVAATGFAT PADDIAAGDL KINGVDVGAI TAGGTAPLQG AAVAAAINLI SGTTGVSASA DGAGLVTLTS TSSDGITVAM SGTANTARTG LTAGATAATA TTAAGFGALK IDTTADADTA IASMDSALSA LNAARADLGA YQNRFTSAVA NLQTVSENLS ASRSRIMDAD FAAETAALSR NQVLQQAGTA MLAQANAMPQ SVLSLLRG
|
| |