Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1985 |
Symbol | |
ID | 4027069 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 2237768 |
End bp | 2238838 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 637967181 |
Product | flagellin-like protein |
Protein accession | YP_574036 |
Protein GI | 92114108 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGTTA TCAATACGAA CATCACGGCC ATGATCGGCC AGCAGAACCT GAGCCAGTCA CAGAGCGCTC TGTCCACTTC CATGGAACGC CTGTCTTCCG GTCTGCGCAT CAACAGTGCC GCCGACGACG CCGCCGGTCA GGCCATCGCC AACCGCATGT CCAGCCAGAT CACCGGTCTG AGCCAGGCGC AGCGTAACGC CAACGACGGC ATCTCCGTTG CCCAGACCAC CGAAGGCGCT CTGAACCAGG TCAACGACAA CCTGCAGCGC GTGCGTGAAC TGACCGTTCA GGCTCAAAAC GGCACCAACA GCCAGGAAGA CTTGGATTCC ATCCAGGACG AGATCAACCA GCGTCTGGGT GAGATCGACC GTATCTCCGA AGAAACCAGC TTCAACGGCG TCGACGTTCT CGCTAGCGAC CAAGAAATTT CCATCCAAGT CGGTTCTGAA GATGGCCAAA CCATCACCAT GAACCTTCAG GAAGTTAATG CGGAGACTCT TGGCCTCAGT AATTTTGACG TTTCTGATCG CGCAGAGTCA GTTTCTGCAG GTGGCTATGA CTCCGGCTCA ACAGTTGCAG CTGATGACGC AACATTTAGC TTCACTGACA GCACAGGTGC TGTCAGTGAT TTCGACGGGT ACTCCGTCGT AGAAAATGAT AGCGGCACTT TCGTTCGTGA TGACCAAGGC AACTACTATG ATGCTACAAT GTCAGTAGCT TCTGCTGATA CAAGCTCAGT AGGGGTAACA TTTGACTCGG ATAGCGCCTT GACTGCTGAT GAACTATCCA CAAATGGTCT CTCTCCACTG GCTGACATTG ATGCTGCAAT TAGCAACGTT GATAGCCAAC GCTCTGAGTT GGGTGCCATG CAGAACCGCT TCGACTCGGC CATCACCAAC CTGAGCACCA CCGAGACCAA CCTCTCTTCT GCTCGCTCGC GCATCGAAGA TGCCGACTAC GCGGACGAAG TCTCCAACAT GACCCGTAAC CAGATTCTGC AGCAGGCAGG TACTTCCGTG CTGGCCCAGG CCAACCAGCT GCCGCAGAAC GCGCTGTCTC TGCTGGGCTA A
|
Protein sequence | MAVINTNITA MIGQQNLSQS QSALSTSMER LSSGLRINSA ADDAAGQAIA NRMSSQITGL SQAQRNANDG ISVAQTTEGA LNQVNDNLQR VRELTVQAQN GTNSQEDLDS IQDEINQRLG EIDRISEETS FNGVDVLASD QEISIQVGSE DGQTITMNLQ EVNAETLGLS NFDVSDRAES VSAGGYDSGS TVAADDATFS FTDSTGAVSD FDGYSVVEND SGTFVRDDQG NYYDATMSVA SADTSSVGVT FDSDSALTAD ELSTNGLSPL ADIDAAISNV DSQRSELGAM QNRFDSAITN LSTTETNLSS ARSRIEDADY ADEVSNMTRN QILQQAGTSV LAQANQLPQN ALSLLG
|
| |