Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1981 |
Symbol | |
ID | 7316370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | - |
Start bp | 2100130 |
End bp | 2101518 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643616874 |
Product | flagellin domain-containing protein |
Protein accession | YP_002514049 |
Protein GI | 220935150 |
COG category | [N] Cell motility |
COG ID | [COG1344] Flagellin and related hook-associated proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCAGA TCATCAATAC CAACGTGCTG TCGCTCAACG CCCAGCGCAA CCTGACCAAC TCCCAGAGCG CCCTGCAGAC CTCCCTGCAG CGGCTGTCCT CGGGTCTTCG CATCAACAGC GCCAAGGACG ATGCCGCCGG TCTGGCCATC TCCGAGCGCT TCACCACCCA GATCCGCGGT CTGAACCAGG CTGTGCGCAA TGCCAACGAT GGCATCTCCT TCGCCCAGAC GGCCGAAGGC GCCCTGTCCA CCGTGGGTGA TGCCCTGCAG CGTATCCGCG AGCTGGCGGT ACAGTCCGCT AACGACACGA ACTCCTCCTC TGACCGCCAG GCCCTGAACA ATGAGGTGCA GCAGCTGATC GCGGAGGTGA ACCGGGTTGC GAATGCCACT GAGTTCAATG GTCAGAAGAT CCTGGACGGC AGCCTGTCTG AACTGGTGTT CCAGGTGGGT GCCAACCAGA ACCAGACCAT CACCGTTAAC GGTGTAGACG CCCGAGCCTC CCAGCTCGGT GCCCGTGTGG CTGAAGGTGA CTCTGCCCTG TCTGGCGATT TCACCGGCGC CTTTGATTTC GGTACGCCAG GTACTCTGGC CATTCAGGGT GTTGAGATTG ATCTTGCAGA CCTCAACACC GATGCCACTT CTTTGACCGA AGTGGTTAAT CGAATCAATG CATTGTCCGC GGAAACTGGT GTCACCGCTG CGCTGTCCTC CCAGGCCGAA GCCACGTTTG CCATCACGGC TGCGGATGCC GCGGGTACCC TGAACATCAA TGGGGTGAAC ATCGCTGTCG ATAGTGGTGA TACCGCTGAG TTGTTGGCTG GCAAGATCAA TGCACTGTCC AACCAGACCG GTGTTACCGC AAGCTTTGAC GGTGCTGATC TGACGCTGAC TTCCAACGGC GACATCACGA TTGAGCGAAC TGGTGGTGGC GCTCTGGTCA TCGGCGGCCT GGGAGCCGGT GAATCCGGTA CCATCATGCG TGGAATTGAT CTGGCCACCA ATGTGGGTCA GGACATCCTT GTGGCCACCA CTGGCAGTGC CGACGACCTT AACATTACTG CTGGTGGTAC TGATCGCGCC CTGACTGATC AGAACGTCCT GAGCCGTGAA CGTGCTAACC AGACTATCCA GGTCATGGAT TTCGCCCTAC AGCAGGTCAG TGGTCTGCGC GCCGAACTGG GTGCGGTCCA GGTGCGCTTC GAGTCCACCA TCTCCAACCT GGGCGTCTCC GTGGAGAACC TCTCCGCTGC CCGGTCCCGT ATCCGGGATG CGGACTTCGC CGCAGAGACC GCCGAGCTGA CCCGCGCCCA GATCCTGCAG CAGGCCGGTA TCTCGGTGCT GGCCCAGGCC AACGCGGTGC CCCAGAGCGT GCTGGCCCTG CTGCAGTAA
|
Protein sequence | MAQIINTNVL SLNAQRNLTN SQSALQTSLQ RLSSGLRINS AKDDAAGLAI SERFTTQIRG LNQAVRNAND GISFAQTAEG ALSTVGDALQ RIRELAVQSA NDTNSSSDRQ ALNNEVQQLI AEVNRVANAT EFNGQKILDG SLSELVFQVG ANQNQTITVN GVDARASQLG ARVAEGDSAL SGDFTGAFDF GTPGTLAIQG VEIDLADLNT DATSLTEVVN RINALSAETG VTAALSSQAE ATFAITAADA AGTLNINGVN IAVDSGDTAE LLAGKINALS NQTGVTASFD GADLTLTSNG DITIERTGGG ALVIGGLGAG ESGTIMRGID LATNVGQDIL VATTGSADDL NITAGGTDRA LTDQNVLSRE RANQTIQVMD FALQQVSGLR AELGAVQVRF ESTISNLGVS VENLSAARSR IRDADFAAET AELTRAQILQ QAGISVLAQA NAVPQSVLAL LQ
|
| |