Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_p0032 |
Symbol | |
ID | 5148678 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009475 |
Strand | - |
Start bp | 22446 |
End bp | 23855 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640538968 |
Product | hypothetical protein |
Protein accession | YP_001220401 |
Protein GI | 148240900 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.555164 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAAAC GCCCGATGTG GAAGCCAAAA TGGAGCGAGC CATGCCCATG CGCGTCCGGG AAAAAATACA AGCAATGCTG CTGGCAGCGG CTGCAAGGCT TTGACATCGG CAAGGCGTAC GACCAGGCGA TCAAGGATAA TAACCTGGAG CGGGCGCTGA GCGCGACCCG CGCGGACATC ACGCAATATA CAATCTGGCA CAAGACCAAT ACCGCTCCCG CGTTGGGCCG GGTCAGCGTC GGGCCCGAGC TGTTGCGGAT CGACGTGAAC GCCCTCGGCG CCTACGTTTC TCGCCTGTCG TGGCTATATT TCCGGCTCGG CCTCACGAAG GATTGGCTGG CGACGCTGGA CCGGCTGCGC ACCAACATTC AGCATCCTTC GTGGTACAGG AAGATCGTCT ACTATCGGGC TATGCATTAT CTGTCGCCCG GCAGTGACCG AGAGGAAGCC CGGCGGGAGC TGGCGAAGGC GGGTCCGATC ACGAAGGCGG AAACGGACCT TGAGCTGCTG CAACTGTACG TCGACCTGGA GTTCGACGAT CAGCCGTTCG CGCCCCGAAT GGAGATTATC GACCGCATCC TCGAACTCGA CGAAGATCGC GACAATCAGC TCCAATATCG CGGAGCGAAA GCGGTTCAGT ACTTGATGAT CGGAGACACC AAGACGGCCG AGCAAATCTT GGCCGAGGTC CTCGACATCG TGAAAGACAC TGAGGCCGAC GACCCCCTTG GAGCATATGA GCGGCACCTC TTCGGACGGC TCCTGCAATT GCAGGGTAAC CTGCGACGCG ACAAGCAGCT GCTGAAGGAC TCCGCAGCGC AGTTTCATGC GCTGCTGCTT GAGGACAACT GGACGCAGGC CGGCAGGGCC GCGCTGCAGC GCGAATTGGG CGACAGCTAC CGATACGCTG ATGATTGGGA GCAGGCGGAA TGCGCTTACC GGGAAGCCCT TCAGCTCGGA TCTTCTGACC TCGACAAGGT GCACCTGGCG GAATGCCTCC TTTACCGCAA GCAGATCGAG GCCGCGAGCA AGGAAATCGA CGCCGTCAAG CGGGAAACCC TGCTGTGTCA TGAATTCGAG GATTTCGTCT TCGCCTATTC GGCCATTGCG ATCTGGTCGA CCGCGCCGGA GCGGCTGACC AAGGCGAAGA CCTTACTCCA GTCACTGGGA ACGGCCGAGC CGCTGTTCAA TGAGCGGCGA CTCAACTTGT TGCTCCGTGT GACGGAGACG CTCGCAAGCG GCAAGGCCTC GACTAAAGCG AAATCCGACA GCACGCCAGA CGGCGGGCTC GCGACGGTAT CCAACTTCTT CTTGCTGGAG CCGAACATCG CGGGCATCGG CATTAACTTC AACGCGATCA TCAATTATCT CGCGCGCAAG AAGGCCAGGA AGAAGCCCGA GACTGAGTAG
|
Protein sequence | MPKRPMWKPK WSEPCPCASG KKYKQCCWQR LQGFDIGKAY DQAIKDNNLE RALSATRADI TQYTIWHKTN TAPALGRVSV GPELLRIDVN ALGAYVSRLS WLYFRLGLTK DWLATLDRLR TNIQHPSWYR KIVYYRAMHY LSPGSDREEA RRELAKAGPI TKAETDLELL QLYVDLEFDD QPFAPRMEII DRILELDEDR DNQLQYRGAK AVQYLMIGDT KTAEQILAEV LDIVKDTEAD DPLGAYERHL FGRLLQLQGN LRRDKQLLKD SAAQFHALLL EDNWTQAGRA ALQRELGDSY RYADDWEQAE CAYREALQLG SSDLDKVHLA ECLLYRKQIE AASKEIDAVK RETLLCHEFE DFVFAYSAIA IWSTAPERLT KAKTLLQSLG TAEPLFNERR LNLLLRVTET LASGKASTKA KSDSTPDGGL ATVSNFFLLE PNIAGIGINF NAIINYLARK KARKKPETE
|
| |