Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2361 |
Symbol | |
ID | 3915706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2508639 |
End bp | 2509622 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640445116 |
Product | flagellin-like |
Protein accession | YP_497631 |
Protein GI | 87200374 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGTCA TCAACACCAA TATCAGCGCG ATCCGCGCGA CGAATGCTTC GAACTCGGCG AACAAGATGC TCGGCACGGC CATGGAGCGC CTGTCGACCG GCAAGCGCAT CAACGGCGCG AAGGACGATG CAGCGGGCCT CGCGATCACC ACGACCATGA CCTCGCAGAT CCGCGGCATG AACCAGGGTG TCCGCAATGC CAACGATGGC ATCAGCCTTG CGCAGACCGC GGACGGCGCA CTCAACGAAG TGACCGCCAT GCTGCAGCGC ATCCGCGAAC TTGCCGTTCA GGCGAAGTCG GGCACCTATC AGCAGTCCGA CCGGGACGCG ATGCAGTCGG AAGTCGCAAA CCTCACGCAG CAGATCAGCG ACGTCTTCAA CAACGTCAAG TTCAACGGAA ATCAGGTCTT CAGCGTGTCC GACGGCACTG GCGGCACTGG CGACCCGACT GACTACTCCG AAGACAACGT ATCGATCGAC GATGCCGAAT TCGTGATCCA GACCGGCGCC GAGATCGACA ACACCGTCAC CCTCGTCAGC AAGGCCTTCG ATGGCGCGAA GCTGTTCGGC GTGATTTCGG ATGGCGGCGC CAACGATGGC CTGGCCATGA CGGTCTACGA CGACTCCACC TATGACGACG ACAGCGATAC AACGACTCCC GAGGTCCCCG TCGTGACGCA GGCCCTCGAC GTCTCGACCA GTGCCAATGC CTCGACGACT ATCGAAAACG TCGATTCGGT GCTGGCCAGC ATCAATTCCA CCCGTGCCGG GCTGGGTGCG GGCCAGAACC GCCTGGAATC GGTGATCAAC AATCTCAACG ACAACGTCAC CAATCTGTCC GACGCCCGGT CGCGGATCAT GGACACCGAC TATTCGGCCG AAACGACCGC GATGGCCAAG GCCCAGATTC TCAGCCAGGC TTCGACGGCG ATGATCGCCC AGGCCAATCA GGCCCAGCAG AACGTGCTTT CGCTTCTGAA GTAA
|
Protein sequence | MAVINTNISA IRATNASNSA NKMLGTAMER LSTGKRINGA KDDAAGLAIT TTMTSQIRGM NQGVRNANDG ISLAQTADGA LNEVTAMLQR IRELAVQAKS GTYQQSDRDA MQSEVANLTQ QISDVFNNVK FNGNQVFSVS DGTGGTGDPT DYSEDNVSID DAEFVIQTGA EIDNTVTLVS KAFDGAKLFG VISDGGANDG LAMTVYDDST YDDDSDTTTP EVPVVTQALD VSTSANASTT IENVDSVLAS INSTRAGLGA GQNRLESVIN NLNDNVTNLS DARSRIMDTD YSAETTAMAK AQILSQASTA MIAQANQAQQ NVLSLLK
|
| |