Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_5361 |
Symbol | |
ID | 5150113 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 5588816 |
End bp | 5590606 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640560120 |
Product | hypothetical protein |
Protein accession | YP_001241244 |
Protein GI | 148256659 |
COG category | [S] Function unknown |
COG ID | [COG3025] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.319316 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.00226049 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCGGATT CGAACGCCGC GTCGATCCAA GGCAGCGATT GCGTTATGCC TGGATCAAGC GTACAAGATG CGACGAGCAG CAACGCAGAC TGCACGGCGG ACCCTCCAAG TGGGATTGCA CCTTCCCGTC GGATCGACAC TGCTGGCCAG CCGGTCTCTT TGGCTGTGGA TTCGGGGGCT GCGGATCCGG CAGCTGCAGA TGCGGGCGCG GCGCCTCTCG ATCCCGCGCG ACCCGCATTG CCCGATCCCG CAATCTCCGG CGAAGAGATC GAGCTGAAGC TGCTCGTCGT CCCCGAGCAG CTCGCCGATT TCAACAATGC GCCGATCGTC GTAGCGCATG CGCGCAACAA GGGCACGCGC AAGCATCTCA CATCCGTCTA TTACGATACG CCGCGACGAC AGCTCTGGAA GAACGGCTTC ACGCTGCGGG TGCGCCAGAG CGGCTCGCGT TTCGTGCAGA CCGTCAAGTC CCAGCAGAGC GACGACCCGC TGAAGCGCGG CGAATGGGAA GCGAGCGTCA CCTCGCTCGC GCCGGACACG GCATTGGCGG CGGCACTGCT GCCCGAGGAA CTGCGCGCAG CCGTGACCGA CTCGCCGCTC GAGCCGGTGT TCACGGCAGA CGTGCATCGT CATGCGCGGC TGCTCGATCT GCCCAACGCA ACCCTCGAGG TCGCCTTTGA CAGCGGCGTG ATCAAGGCGG GTGAGCACAG CGAGACGGTC AGCGAGATTG AGCTGGAACT CAAGAGCGGC AACCCCGCGA CGATCTACGA GGCGGCGCTG CGGCTCGCCG AGCACGGTCC GGTGCGGCCC TCGATCCGCA GCAAGTCGGC ACGCGGTTTC GATCTCGCCG CAAGCGTAGC GCCAGGCGCC GAGAAACCGC AGAAGCTTCA TCTGGATCCC GCGGTCTCGC TGGACGAGGC CTTCGCGGTG ATCCTGCGCG GCAGCTTCCA TCATCTGCTG CAGGCGCTGC CGGCAGCCGA GGATGGCCGC GATCCGGAAG GCGTTCATCA GATGCGGGTG GCGCTGCGGC GGCTCCGCGC GGCGCTGCAT CTGATGCGGC CGCTCGGGGT TTCGGCGACG CTGGAGAGCC TCGAGACCGA TGCACGCTGG CTCGCACAGA ATCTGTCCAC GGCGCGCGAT CTCGACGTCT TCCTGACCGA GACATTGCCT GAGATCGCCG AAGCCTGTCC GACGGTCGCC GGCTTCGACA CGCTGCAGAG CCTGGCCGAG CGGCAGCGCG CGCTTGCTTA TCGCAAGCTG CGCATGGCGG TCGCCGACCG CCGCTGCGCC GCCTTCGTGC TCGGTCTCGG CGCGTTGATC GCGACGCGTG GCTGGCGCAA CGACGTCTCT CCAGACGAGC TTGGGCGGCT CGCGGGGCCG GCGCTCGATT TTGCCGGACA CGTGCTGGCG GAGCGGCATC AGACGGTGCT CAAGCGCGGC CGACGTTTCA AGAAGCTGCC GGCCGAGCGC CGCCATCGGG TGCGGCTGGC GCTGAAGAAG CTGCGCTACA GCATCGACTT CCTGCTGCCG CTCTACGGCG CAAGCAAGCC GGCCAGGAAA TATGCCAGGA CCCTCGCCGA CCTGCAGGAG CAGCTCGGCT ATTACAACGA CATGGCGGTG ACCGCCGGCG TCATCGCCGA TCTCGGCACG ACCTCGACCG ATGCGGCGAT CGCCGCGGCC GCAATCACCG GCTGGCACGC CCATGCGATG GCCGGCGTCG AGCAACCGCT ACGCGAGGCG TGGCGTGCAT TCGCGAAAGC GCCGACTCCC TGGGGCTCCG AGGAGGCGTA G
|
Protein sequence | MPDSNAASIQ GSDCVMPGSS VQDATSSNAD CTADPPSGIA PSRRIDTAGQ PVSLAVDSGA ADPAAADAGA APLDPARPAL PDPAISGEEI ELKLLVVPEQ LADFNNAPIV VAHARNKGTR KHLTSVYYDT PRRQLWKNGF TLRVRQSGSR FVQTVKSQQS DDPLKRGEWE ASVTSLAPDT ALAAALLPEE LRAAVTDSPL EPVFTADVHR HARLLDLPNA TLEVAFDSGV IKAGEHSETV SEIELELKSG NPATIYEAAL RLAEHGPVRP SIRSKSARGF DLAASVAPGA EKPQKLHLDP AVSLDEAFAV ILRGSFHHLL QALPAAEDGR DPEGVHQMRV ALRRLRAALH LMRPLGVSAT LESLETDARW LAQNLSTARD LDVFLTETLP EIAEACPTVA GFDTLQSLAE RQRALAYRKL RMAVADRRCA AFVLGLGALI ATRGWRNDVS PDELGRLAGP ALDFAGHVLA ERHQTVLKRG RRFKKLPAER RHRVRLALKK LRYSIDFLLP LYGASKPARK YARTLADLQE QLGYYNDMAV TAGVIADLGT TSTDAAIAAA AITGWHAHAM AGVEQPLREA WRAFAKAPTP WGSEEA
|
| |