Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_5527 |
Symbol | |
ID | 5150512 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 5741512 |
End bp | 5743083 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640560271 |
Product | putative flagellin protein, C-terminus |
Protein accession | YP_001241393 |
Protein GI | 148256808 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGGTA TTGTTCTTTC GGCTTCGGTT CGCCAAAACC TGCTGGCCTT GCAAAATACG GCATCGCTGC TGGCCACGAC ACAAAACAAC CTGGCCACCG GCAACAAGGT CAACACTGCG CTCGACAATC CCACCGAATT TTTCACTGCG CAATCACTCA ACAATCGTGC GAGCGACATC GCGAACCTGC TCGATAGCAT CGGCAACGGT GTGCAGGTTC TGCAAGCCGC GAATACCGGC CTGACCTCGC TCCAGAAGCT GGTCGACAGC GCCAAGTCGA TCGCGAGCCA GGTCCTCCAG GCCCCCACGG GCTACACCAC AAAGTCGAGC ATCACGTCAG CTGTCATCCC CGGGGCGACG GCGAACAACC TGCTCGGCAG CTCGTCGAAC AATTTCGTGA CCGGCAGCAC GGTCAACAAT GACAATCTGA GCTCCGCCGT CGCGATCACC GGTTCGACGC GACTGTCCGG TACCCCGAGC TCGACCTCGA ACGACCTGGC CTCCAGCATC ACGACCGGCG ACACCCTCGT GGTCAACGGC GTGGTGTTCA CCTTCGTCGC CGGGTCCGTG TCGGCCGGCA CCAATATCGG CGTCGGCGAC ACCGTCAGCA ACCTTCTCGC CGCCATCGAC TCGGTGACCG GCGCGACGGC CACCCCCTCC AGCGTCACCG GCGGCAAGAT CGCGCTGGCG ACAGGCACGG CGCAGGATCT GACGGTCTCG GGCACGGCCT TGGCCAAGCT CGGATTGACG GCCGCAACGA CGACACGCAA TGCACCGGCA TTGTCAGGAC AGACACTGAC AATAGCCTCG ACAGGAGGCG GTGTCGCGAC GAACATCACC TTCGGCACCG GTGCGTCGCA GATATCGACC CTGGCGCAGC TCAACACCGC GCTTGCCTCG AACAATCTGC AGGCCAGCCT GAGCACAACC GGTCAGCTCA CGATCCTGAC GACCAACGAG GCGGCCTCGT CGACGATCGG CGCGGTCGCC GGTTCGTCGA CCGCTTCCAG CATGGCCTTC AACGGCGTGA CTGCATCGAC CCCGGTGGCC GACACCAACT CGCAGACCAC GCGAGCCGGC TTGATCGCGC AGTACAACAA CGTGCTCGCG CAGATCAATA CGACCGCGCA GGACGCTTCG TTCAACGGCA TCAACCTGCT CAACGGCGAT ACGTTGAAAC TGGTCTTCAA CGAGACCGGC CGCTCGACAC TGAACATCAC CGGCGTGACG TTCAACAGCA CCGGCCTCGG CCTGTCGGCG CTGGTGGTCG GCACCGACTT CCTCGACAGC AATTCGGCCA ACAAGGTCCT GAGTACCTTG AACAGCGCTT CGACCGCGAT CCGCTCGGAA GCGTCGTCGC TCGGTTCGAA CCTCTCGATC GTGCAGATCC GTCAGGACTT CAACAAGAAC CTGATCAACG TGCTGCAGAC TGGCTCGTCG AATCTGACCT TGGCCGACAC CAACGAGGAA GCGGCCAACA GCCAGGCGCT GTCGACCCGC CAATCGATCG CGGTGTCCGC GCTGGCGCTG GCCAATCAGT CGCAGGCAAG CGTGCTGCAG CTGCTGCGCT GA
|
Protein sequence | MSGIVLSASV RQNLLALQNT ASLLATTQNN LATGNKVNTA LDNPTEFFTA QSLNNRASDI ANLLDSIGNG VQVLQAANTG LTSLQKLVDS AKSIASQVLQ APTGYTTKSS ITSAVIPGAT ANNLLGSSSN NFVTGSTVNN DNLSSAVAIT GSTRLSGTPS STSNDLASSI TTGDTLVVNG VVFTFVAGSV SAGTNIGVGD TVSNLLAAID SVTGATATPS SVTGGKIALA TGTAQDLTVS GTALAKLGLT AATTTRNAPA LSGQTLTIAS TGGGVATNIT FGTGASQIST LAQLNTALAS NNLQASLSTT GQLTILTTNE AASSTIGAVA GSSTASSMAF NGVTASTPVA DTNSQTTRAG LIAQYNNVLA QINTTAQDAS FNGINLLNGD TLKLVFNETG RSTLNITGVT FNSTGLGLSA LVVGTDFLDS NSANKVLSTL NSASTAIRSE ASSLGSNLSI VQIRQDFNKN LINVLQTGSS NLTLADTNEE AANSQALSTR QSIAVSALAL ANQSQASVLQ LLR
|
| |