Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_0766 |
Symbol | |
ID | 4599651 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 810073 |
End bp | 811203 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639775367 |
Product | flagellin domain-containing protein |
Protein accession | YP_921978 |
Protein GI | 119715013 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCTCC GCATCAACCA GAACATCGAC GCCCTCAACT CCTACCGCAA CCTGTCGGTC ACCCAGGGCC AGATGAGCAA GTCGCTGGAG AAGCTGTCCA GCGGTTACCG GATCAACCGT GCGGCCGACG ACGCCGCCGG CCTCGCGATC TCCGAGGGCC TGCGCTCGCA GGTCGGCGGC ATCAAGGTCG CGGTCCGCAA CGCCCAGGAC GGCGTCAGCG TCGTCCAGAC CGCTGAAGGT GCGCTGACCG AGACCCACTC GATCCTGCAG CGCATGCGCG ACCTGTCGGT GCAGGCCGCC AACGACAGCA ACGACGTGAG CTCGCGGGCA GCCATCCAGT CCGAGAACGA CGCTCTCGTG GACGAGCTGG ACCGGATCGC GACGTCGACG ACGTTCAACA ACGTCTCCCT CCTGGACGGC AACTACATCG GCAAGAACTT CCAGGTCGGC TACGGCACCG GGGCCACGGA CTCGATCGCG GTGGACATCA CCTCGACCGG CACCGGCGGC GCGAAGTCGA CGTGGGCCAA CGGTGCGGCG TCGACCACGG CGGCGGCCGC GACCTTCACC CACAACGGTG TCGCGACCAC GACCGGCGTG CTCGTCGCCT CGACCGACGC CAACAACATC GCCACCCAGC TGAACGCGGA CGCCAACTTC AAGACCAACT ACACCGCGTC GGTCGACGAC AAGGGTGGCC TGGTCGTCAC GTCGAAGGAC GGTCTCCCGG GTGCGATCAC CGTGTCCGGT GCCGGTCTGG CCGGCGCCCA CGTCGACGCC GCTCCGGGCG CCAACGCGGG CTTCAGCTCG ACCGACCTCG GCGTCGGTGC CCTGGTGCTC GACAGCGCCG CCAACGCCAA GGCCGCGACG AACGCGATCG ACACCGCGAT CAAGGGCGTG TCCACCGCGC GTGCGAAGCT CGGTGCGATG CAGAACCGGT TCGAGCACAC CATCAACAAC CTGAGCGTGA CCCAGGAGAA CCTCTCGGCC TCCGAGAGCC GGATCCGGGA CACCGACATG GCTGCCGAGA TGATGCAGTT CACGCGGAGC CAGATCCTGT CGCAGGCCGG CACCGCGATG CTCGCCCAGG CGAACCAGGC CCCGCAGGGC GTGCTGCAGC TGCTGCGGTG A
|
Protein sequence | MSLRINQNID ALNSYRNLSV TQGQMSKSLE KLSSGYRINR AADDAAGLAI SEGLRSQVGG IKVAVRNAQD GVSVVQTAEG ALTETHSILQ RMRDLSVQAA NDSNDVSSRA AIQSENDALV DELDRIATST TFNNVSLLDG NYIGKNFQVG YGTGATDSIA VDITSTGTGG AKSTWANGAA STTAAAATFT HNGVATTTGV LVASTDANNI ATQLNADANF KTNYTASVDD KGGLVVTSKD GLPGAITVSG AGLAGAHVDA APGANAGFSS TDLGVGALVL DSAANAKAAT NAIDTAIKGV STARAKLGAM QNRFEHTINN LSVTQENLSA SESRIRDTDM AAEMMQFTRS QILSQAGTAM LAQANQAPQG VLQLLR
|
| |