Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Taci_0666 |
Symbol | |
ID | 8630480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermanaerovibrio acidaminovorans DSM 6589 |
Kingdom | Bacteria |
Replicon accession | NC_013522 |
Strand | + |
Start bp | 701162 |
End bp | 704188 |
Gene Length | 3027 bp |
Protein Length | 1008 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | |
Product | flagellin domain protein |
Protein accession | YP_003317184 |
Protein GI | 269792280 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTCAGC GGGTTACTAA CAGCATGATG CACGGCATGT TGCTGTCCGA CATGCAGAGC AACTTGGCCA AGATGCTGGA GATCCAGAAG CAGCTGGCCA CCCAGAAGAA GTTCTCCAGG CCATCGGACA ACCCCATAGA CGTGACCAGG TCTCTCTCTA TGGACACCAC CATAACGGAG AACGTCCAGT ACAGGAGAAA CCTGGACGAC GCTCTGACGT GGCTCAACAA CACCGAGACC GCATTTGACC AGATAACCAG CGTCTATCAG CAGGTCCGTC AGTTGGCCGT ATACGCTGGG GACGGGGGGC TGGTGGACGT GGACATGGGG GCCATCGCGG AGCAACTGTA CCAGCTGCAG GAGGAGATGC GGAACGCGGC CAACTACGAG GTGGAGGGCC GTTTCCTCCT GTCAGGTCTC TCCACCGGTG TCAGGCCCTT CGTTAGGGAC TCCAGCGGCC GGGTGGTCTA CAAAGGCAGC ACCATGCCGG TCTACTTCGA GATGGAGCGC CAGCAGCTGG GCCGGGTCTC CTTTACCGGC CGGGACGTTT TCCAGGTGAA CGAGAAGAAG TACACGTTGA GGAGCTTCGA GGTGCCCTTG GACTTCACCT GGAAGGGGCG GGACGAGATA ATCCAGCTCC AGGTGGGAGA TCAGGTGGTG AAGGCCCGTC TGTCGGAGAG GTGGCAGGAC GAGAAGCTGG ACAACGTGGC GGACAGCACC GACTACGACC GTTTCCGGGA GGCTTCGGAG CTGGAGGGTT ACAGTCTGGA CGACGTAGCC AAGGCGCTGA ACGACAGCAT AGAGATGGGG GATCTCAAGC GGCTGGTGTC CGTATCGGTG GAGAAGGACA CCGCCAAGGG GGTCCAGCGG CTGGTGATAA GGAGCCACAA CGGCCTTCCG GTTAGGCTCA CCAGCTGGCC CGAGACTGAC ATACCAAAGC TCTCCCAGGG TGTTCGGGGT GTCGATGTTC CCCTGGCCCC CCCGTTTGTG GCGGACGCCT CTTCCACCCT TACGGTGGAC TTCGGCAATG GGGTGACCCA CGACGTGGCG GTGGCGGGCA AGACCTTGTC CCAGATTGCC GATGAGCTCA TGAAGGTCCC TGGTTTATGG GCGGAGAGAA AGACCGATGG GACCAACGAG TGGCTCCTGG TGGTATCCCG GGATCCGTCC AGGTCCTTCA GCATAAGGTC CACCGGCAAC GTGGCCACCG ATGTCTTTGG ATCCGACACG GTGTTGTCCT CCGAGGTCCA GAAGAACGTG GATCACAGCC ACATAGACCT GGTGCGGCTT CTCGGCATGG AGACCTCCCT GAAGTCCACC GAGGTTTCCC CCACTTGGAA CGTGGACACC ACCGCATCTC CCCTTCACTG GAAGTTCATG GCGGGGGGGA AGAGGGGGGA GCTCTTCATA AACGGTGACC CGGACCTAAC CATGGAGGAG CTGGCCCAGC GGATAAATGC GGTGATGGGT GAGTGGGTCG AGGCGGTGGT GGAGCTGGAC GAGCCGGACG GGGCCAGCCC TAGCCCGGAC CCTTTGGGCA ACAGCGGTTC CAACGCCGAG GAGGCCACCA AGAGGCTTAT CCTGAGGACC CGGGACGGCT CCCCCCTTGT GGTATACGAT GGGGAGAGGG CGGGGGCCAA CTACGCCTCC CAGCTGGGGG TGGATACCAG CGTCAGGGGG GCGGGGCCTT TGACCTATCC GAGCGACGGG GCTGGGCCTT TCGACGAGAA CATGCCCGCC CTGGTGGAGG TCCGGGTTGG GGACGAGACC TACACGGTGA AGCTCTGTCG GTTGAGGCAC GACACGGGGG AGAAGGTGGC GGAGGCCATC GTCGCCCAGG TGAACCAGAT GGCGGGGGAG AGGCTTTTGG ACGTGGACAG TCTGTCCTCC TCCGATGACT TCGCCATCTT GTCCCTCACG GGACAGCCGG TTAGCATCGT GGACCGGGGC TATGGGGATC CCGCCTACGG CCAGTATACC GGTGGCGTGG CCATCCAGCT GGGCATCGCT TCCGGTGTGA CCGGGGGCGG CGTGCCGGGC AACACCGCTG CCGGGGCCGA CGGGACGGTC AGGATCTCCT CCCTTGGGCG GTGGGTTGAC GTGCCGGTCC TGGCCACCGA CGACGTGAAG AGCTTCCTGG ACCGGGCCAG GGACCTGGCG GGGGATTGGT TGGATCTGTC GTACTTCGAT CCGAGCCTGT CCAACCCTCC GGGGGGCACT AACGTCAGTT TCTCCATATC CGCCAAGGAC GGTTCCCCGG TATCCATCTT CGATCTGTCC GGTTCCGCCT CCAGCGTTTT CAACATGGGC ACCGGTCTTT CGGGCAGTTC GCTTGGGGCT TGGACTCCGG TGGCGGGTGA CGTGCTCACC ATATCGGTAA ACGGGGTGAC CCACAGCATA GATTTGTACG ATGAGAGCAA GGGGCAACCC ATCGTATCCA ACTTGGATGA GCTGGCGGAT CTGATAAACG CCCGCTTTCA GGGGCAGGAT CTCGTGGCTC AGACGGTGGA CATGGGGGGG GGGAGCAAGA GGTTGGTGAT AACCTCCCCG AGGGGCTACG TGGTCAATGT TGACGAAAGC GCCATGTCGG CTGGGACCCA GTTGGGGCTC AATGGGACGT CTCCCTCAAG AGGGGGGTAC GGCCCTTTCA ATCAGAGGGT TCAGGTGCGA ACTCTGGGCA ATCAAACCAA ACAGGACTTT TTCGGTGTCA TGGATGACCT GATAAACGCG GTCCGTAACG AGGATCGCCG GGGTATATCG GACCATCTCC TAAAGAAAGT GACCGATTGG GGGGATAACC TGCTCCGATG TAGGACCGAG TGCGGGGCCT TGATAAACCG CTATGAGAAC ACTCAGGCTA GGCTCAAACA GAACAATGTG AATTTGACAG AACTCCAAAG CAAGATATCC GATGTGGATC TGGCGGAGGC GGCTACGCAG TTCCAGATGG CTCAAGCGGT GTACCAGGCC AGTTTGGCGG TCATAGCCAG GATAATCCAG CCGACCCTTG TGGATTTTTT GAGGTGA
|
Protein sequence | MLQRVTNSMM HGMLLSDMQS NLAKMLEIQK QLATQKKFSR PSDNPIDVTR SLSMDTTITE NVQYRRNLDD ALTWLNNTET AFDQITSVYQ QVRQLAVYAG DGGLVDVDMG AIAEQLYQLQ EEMRNAANYE VEGRFLLSGL STGVRPFVRD SSGRVVYKGS TMPVYFEMER QQLGRVSFTG RDVFQVNEKK YTLRSFEVPL DFTWKGRDEI IQLQVGDQVV KARLSERWQD EKLDNVADST DYDRFREASE LEGYSLDDVA KALNDSIEMG DLKRLVSVSV EKDTAKGVQR LVIRSHNGLP VRLTSWPETD IPKLSQGVRG VDVPLAPPFV ADASSTLTVD FGNGVTHDVA VAGKTLSQIA DELMKVPGLW AERKTDGTNE WLLVVSRDPS RSFSIRSTGN VATDVFGSDT VLSSEVQKNV DHSHIDLVRL LGMETSLKST EVSPTWNVDT TASPLHWKFM AGGKRGELFI NGDPDLTMEE LAQRINAVMG EWVEAVVELD EPDGASPSPD PLGNSGSNAE EATKRLILRT RDGSPLVVYD GERAGANYAS QLGVDTSVRG AGPLTYPSDG AGPFDENMPA LVEVRVGDET YTVKLCRLRH DTGEKVAEAI VAQVNQMAGE RLLDVDSLSS SDDFAILSLT GQPVSIVDRG YGDPAYGQYT GGVAIQLGIA SGVTGGGVPG NTAAGADGTV RISSLGRWVD VPVLATDDVK SFLDRARDLA GDWLDLSYFD PSLSNPPGGT NVSFSISAKD GSPVSIFDLS GSASSVFNMG TGLSGSSLGA WTPVAGDVLT ISVNGVTHSI DLYDESKGQP IVSNLDELAD LINARFQGQD LVAQTVDMGG GSKRLVITSP RGYVVNVDES AMSAGTQLGL NGTSPSRGGY GPFNQRVQVR TLGNQTKQDF FGVMDDLINA VRNEDRRGIS DHLLKKVTDW GDNLLRCRTE CGALINRYEN TQARLKQNNV NLTELQSKIS DVDLAEAATQ FQMAQAVYQA SLAVIARIIQ PTLVDFLR
|
| |