Gene BBta_2101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_2101 
Symbol 
ID5155447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp2172114 
End bp2173487 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content69% 
IMG OID640557038 
Productputative Atrazine chlorohydrolase 
Protein accessionYP_001238194 
Protein GI148253609 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.908269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.986241 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGA CAGCGATCTT CGGCAGCTAT GTGCTGACGC GCAAGGATGG CGCGCAGGAC 
GTCCTGCGCG ATCACTGGGT GCTGGTCGAG GGCCGGCGCA TCGCGGCGAT CACGCCGAGC
CGGCCGGTGG CCGACGAGGT GTTCGACCGG CCGGGCCGCT TCGTGCTGCC GGGTCTGTTG
AACCTGCACA ACCACATCTT TAGCGAGGCG ATCGCGCGGA CCTTGACCGA GGACGGCAAT
GGCCGCCGCA ACAACAAGAG CATCATCTAT ACGGTGCTGC TGCCGCTCTC CAAGCGCGGC
GCCGAGATCC TCACGGCCGA GGAGCGGCTG GCGATCGGCC GCATGGGCGT GCTGCAGCTG
TTGAAGGGCG GCGCCACCAC GGTCATGGAG CCGTTCCGCA ACGCGATCCC CGAGTTGTTC
GACGCGGCCT GCGAAATGGG GCTGCGCTTC TACGGTGCGC CCTATCTGTT CTCGACCGGG
GATGCCAAGG CGGATGCCTC CGGCGTCGTG ACCTATGCCG GCGATGACGG CGAGGCCGAC
CTCGCGATCT GGAATGCGCT CTATCAGCGT TGGCACGGGC AGGGCGATGG CCGCGTCCGG
CTGGCGATGA GCCCGCACGC GACCGATACC TGCGGTCCCG ATCTGATGCG CGCCATCGCC
GCGCGGGCGC GCGAGCTCGA CGTTCCCATC ACCATCCACA TGGCGCAGAG CGCGGGTGAG
GTGGCGACCA TCGCACAGCG CCATGGCGGC CGCACGCCGG CCGAATATCT CGACTGGCTC
GGCCTGCTCG CTCCCGACCT GCTCGCCGCG CACTGCCACG CCTCGACCGA TGCGGATCTC
AGGCTGATGG CGGCACGCGG CGCCGGTGTG CTGAACTGCC CGCGCGTGTT TGCGCGCGCC
GGCATCACCG CCGCCTTCGG CCGCTTCGCC GCGCATGGCG TGCGTACCGC CGTCGGCACC
GACGGCTACA ACATGGACCT GCTCGGCGAG CTCAATGCGG CCTCGTTGAT TTCCAAGATC
GCGCTGGGTA GCGCCGAGAC GGCGAGCGCG CCGGAACTGA TCGACGCGGT CACTGCAACC
GCGGCCGCCA TGATCAAGCG CGACGATCTC GGCGTGATCG CCCCGGGCGC GACCGCCGAT
CTCACCATCG TCGACATGAC GCATCCGCAT CTTCAGCCAT TGCACGATCC CCGTCGCGGT
CTGATCGCGC TCGCCAACCG TGCCAATATC GACCAGGTGA TGGTCGACGG CCGGCTGCTG
ATTCACGACG GCCACTATCT CCACGGCGAC GAGGCCGCGA TCACGGCGGC GGGGGCGACG
GCGATCGCGA AGATCTGGGC GCTGCCGGAG GCGCAGGCGG CGTTTGCGGG CTGA
 
Protein sequence
MSTTAIFGSY VLTRKDGAQD VLRDHWVLVE GRRIAAITPS RPVADEVFDR PGRFVLPGLL 
NLHNHIFSEA IARTLTEDGN GRRNNKSIIY TVLLPLSKRG AEILTAEERL AIGRMGVLQL
LKGGATTVME PFRNAIPELF DAACEMGLRF YGAPYLFSTG DAKADASGVV TYAGDDGEAD
LAIWNALYQR WHGQGDGRVR LAMSPHATDT CGPDLMRAIA ARARELDVPI TIHMAQSAGE
VATIAQRHGG RTPAEYLDWL GLLAPDLLAA HCHASTDADL RLMAARGAGV LNCPRVFARA
GITAAFGRFA AHGVRTAVGT DGYNMDLLGE LNAASLISKI ALGSAETASA PELIDAVTAT
AAAMIKRDDL GVIAPGATAD LTIVDMTHPH LQPLHDPRRG LIALANRANI DQVMVDGRLL
IHDGHYLHGD EAAITAAGAT AIAKIWALPE AQAAFAG