Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_2780 |
Symbol | |
ID | 5150934 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 2872325 |
End bp | 2874190 |
Gene Length | 1866 bp |
Protein Length | 621 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640557665 |
Product | hypothetical protein |
Protein accession | YP_001238819 |
Protein GI | 148254234 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2015] Alkyl sulfatase and related hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.37616 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCGACG CGTTGCCTTT TGCGGACACG GCTGATTTCG CGGACGCGGC GCGCGGCTTT CTCGGCACCA TCGAGGATGC CAAGGTCACG ACGGCGCAGG GGCGGACCGT CTGGAGCCTT GCGCCCTACA GCTTTCTGGA TGCAGAGGAC GCGCCGCCGA CCGTCAATCC GAGCCTGTGG CGGCAGGCGA AGCTCAACAT GCATCATGGC CTGTTCGAGG TCGTGCCCGG CGTCTACCAG GTGCGTGGGC TCGATATCGC CAACATGACG CTGATCGAGG GCGAGACGGG GGTGATCGTG GTCGACACGC TGACCTCGAT CGAGGGCGCG CGGGCCGCGC TCGAGCTGTA TTATCAGCAT CGCGGGGTCC GGCCGGTCAC GGCTGTGATG TTCACCCACA CCCACACCGA TCATTGGGGC GGCGCGCGCG GCGTGGTCGA CGAGGATGCG GTCGCCAGTG GCCGGGTGCC GCTGATCGCG CCGAACCTGT TCATCGAGCA TGCGGTGTCT GAGAACATCA TCGCCGGTCC CGCGATGCTG CGCCGGGCGC AATATCAGTT CGGTCCGTTG CTGGCCAAGG GCGCGAAGGG GCATGTCGAT TGCGGGCTCG GCAAGAGCAT GGCGGCGGGA TCCGTGGCGC TGCTGCGGCC GACCGACCTC ATCATGGCGA CGGGTGACAC CAGGACCATC GACGGGCTCG TCTTCGAATT CCAGATGGCG CCGAACAGCG AGGCGCCGGC CGAGATGCAT TTCTACGTGC CGCGCTACAG GCTGTTGAAC CTCGCCGAGA ACTGTACGCA CAACTTCCAC AATCTGCTGC CGTTTCGCGG CGCCGATGTG CGCGATGCGC TGGCCTGGTC GAAATATCTC GGCGAGGCGC TGCAATTGTG GGGCGGCAAG GCCGAGGCGA TGTGCGGCCA GCATCATTGG CCGGTATGGG GTGCGGGCCG GGTCGACACC GTCATCCGCG AGCAGCGCGA TCTCTACAAA TTCGCGCATG ATCAGACCGT GCGGCTGATG AATCACGGCC TGACCGCGAG CGAGATCGCC GAGACCATCG CGCTGCCGAA AAGCCTCGAA GGCGCGTGGC ATGCGCGCGG CTATTACGGC CACATCCGGC ACAATGTGAA GGCGATCTAT CAGAAATATC TCGGCTGGTA TGATGCCAAC CCCGCCAACC TCGATCCTTT GCCGCCGGTC GAGGCCGGGC GGAAATATGT CGAATACATG GGCGGCGCGG CAAGCCTGCT GGCGCGGGCG CGCGAGGATT TTGCAAGAGG CGAGTTCCGC TTCGTGGCGC AGGCGGTCAG TCATCTCGTC TTTGCCGAGC CCGACAATGC CGCGGCGCGC GCGCTGCTGG CCGATACGCT GGAGCAGCTC GGCTACGCGG CCGAAAGCGC AACCTGGCGC AATGCCTATT TGTTCGGTGC GCAGGAGCTG CGCCACGGCA TGCCGGACGT CCCGGCGCGC CCGGGCATGC CGCGCGAGAC CCTGGCGGCC TTGCGCACGG AGCAATTATG GGACGTGCTC GGCGTCCGGC TGAACGGGCC GAAGGCCGAG GGCAAGCACA TCGTGCTGAA CTGGACCTTC ACCGATACCG GCGAGCGCTT CGTGCTGACC CTGCAGAACT GCGCGCTGAC CTATGCGGTG GGCGTGCAGG CCTCTACGGC GGACGCGGGA TTTACGCTGG CGCGCTCCAC ATTCGACGAC ATCATCGCGA AAGCTGTGAC CTTTCCGGAC GCGGTCGCGG CCGGAAAGAT CAGCTTCGCT GGCAATCCGA TGCGGCTTGC CGAGCTGATG TCCTTGATGG ACGAGTTCCC GCGCATGTTT GAGATCGTCG CGCCGAAACG GACGAAGGTG ACTTAG
|
Protein sequence | MLDALPFADT ADFADAARGF LGTIEDAKVT TAQGRTVWSL APYSFLDAED APPTVNPSLW RQAKLNMHHG LFEVVPGVYQ VRGLDIANMT LIEGETGVIV VDTLTSIEGA RAALELYYQH RGVRPVTAVM FTHTHTDHWG GARGVVDEDA VASGRVPLIA PNLFIEHAVS ENIIAGPAML RRAQYQFGPL LAKGAKGHVD CGLGKSMAAG SVALLRPTDL IMATGDTRTI DGLVFEFQMA PNSEAPAEMH FYVPRYRLLN LAENCTHNFH NLLPFRGADV RDALAWSKYL GEALQLWGGK AEAMCGQHHW PVWGAGRVDT VIREQRDLYK FAHDQTVRLM NHGLTASEIA ETIALPKSLE GAWHARGYYG HIRHNVKAIY QKYLGWYDAN PANLDPLPPV EAGRKYVEYM GGAASLLARA REDFARGEFR FVAQAVSHLV FAEPDNAAAR ALLADTLEQL GYAAESATWR NAYLFGAQEL RHGMPDVPAR PGMPRETLAA LRTEQLWDVL GVRLNGPKAE GKHIVLNWTF TDTGERFVLT LQNCALTYAV GVQASTADAG FTLARSTFDD IIAKAVTFPD AVAAGKISFA GNPMRLAELM SLMDEFPRMF EIVAPKRTKV T
|
| |