Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_3998 |
Symbol | |
ID | 5152061 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 4205766 |
End bp | 4206485 |
Gene Length | 720 bp |
Protein Length | 239 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640558831 |
Product | putative (S)-2-haloacid dehalogenase |
Protein accession | YP_001239972 |
Protein GI | 148255387 |
COG category | [R] General function prediction only |
COG ID | [COG1011] Predicted hydrolase (HAD superfamily) |
TIGRFAM ID | [TIGR01428] 2-haloalkanoic acid dehalogenase, type II [TIGR01493] Haloacid dehalogenase superfamily, subfamily IA, variant 2 with 3rd motif like haloacid dehalogenase [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.666733 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGATC TCTCGATGGT CAAGGCGCTG GTGTTCGACG TGTTCGGCAC CGTCGTCGAC TGGCGCACCA GCCTGATCAA CGACTTCACC GCCTGGTCGA AGTCCCGCGG CATCCAGGGC GACTGGACCG CGCTGGTCGA CGGCTGGCGC GGGCTCTATG TCGGCTCGAT GGACGAGGTC CGCAAGCATC CTGAGCGCGG CTACGTCATC CTCGACGTGC TGCATCGCCG CTCGCTGGAG ACGCTCGTCG CCCAGCTCGG CATCAGCGGC CTCACCGAGG CCGATCTCGA TCATCTCACC CGCGGCTGGC ACCGGTTGCA TCCGTGGGCC GATAGCGTCG CTGGGCTGAC CCGGCTGAAA CGCAAATACA TCATCGCGCC GCTCTCCAAC GGCAATGTCG CGCTGCTCAC CAACATGGCG AAGTTCGCCG GTCTCCCCTG GGACCTGGTG CTCTCCGCCG AGCTGTTTCA GCACTACAAG CCCGACCCCG AGACCTATCT CGGCGCCGCG CGTCTGCTCG GTCTTGCGCC CGGCGAGGTC ATGATGGTCG CCGCGCACAA CAATGATCTC GAAGCGGCCC AGCGCTACGG CCTCAAGACC GCCTTCGTCG CGCGGCCCAC CGAATACGGC CCGCTGCAGA GCCGCGACTT CGAGGCCACC GGCGCCTGGG ACATCGTCGC CGCCGACTTT GGCGGCATCG CCGACCGCCT AGGCTGCTGA
|
Protein sequence | MSDLSMVKAL VFDVFGTVVD WRTSLINDFT AWSKSRGIQG DWTALVDGWR GLYVGSMDEV RKHPERGYVI LDVLHRRSLE TLVAQLGISG LTEADLDHLT RGWHRLHPWA DSVAGLTRLK RKYIIAPLSN GNVALLTNMA KFAGLPWDLV LSAELFQHYK PDPETYLGAA RLLGLAPGEV MMVAAHNNDL EAAQRYGLKT AFVARPTEYG PLQSRDFEAT GAWDIVAADF GGIADRLGC
|
| |