Gene BBta_4420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4420 
Symbol 
ID5153793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp4631615 
End bp4632643 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content67% 
IMG OID640559228 
Productputative sugar hydrolase/Beta-N-acetylhexosaminidase 
Protein accessionYP_001240365 
Protein GI148255780 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.821767 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGC GGGCTTTCAT TACCGGCATC TCCGGCCCTG ATCTCACCGA GGCCGAGCGC 
GCGTTTATCC GCGCGGCGAA GCCCTGGGGC TTCATCCTGT TCAAACGCAA TGTCCAGTCA
CCTGCGCAAG TGACTGCACT CGTTGAACAA TTGCGTGCTT GCGCGGGTCG GGCTGAGGCC
CCCGTTTTGA TCGACCAAGA GGGCGGGCGG GTCCAGCGGC TGGGGCCGCC GCATTGGCCG
GTCTATCCCG CTGGTGTCGT CTTCGACCGC CTCTACGACC TTGATTCGTC CCTCGGCCCG
CGTGCCGCCT GGCTCAGCGC CCGCCTGATC GCCGACGACC TGCAGCAACT CGGCATCACC
GTGGATTGCC TGCCGCTGGC CGATGTCCCG GTTGCCGGCG CGGACGCGGT GATCGGCGAT
CGGGCCTATG GAACGACGCC GGCCAAGGTG GCGGCGATCG CGCGGGCGGT GACGGATGGG
CTGGAGCAGG GCGGCGTGCT GCCGGTGCTC AAGCACATTC CCGGTCACGG CCGGGCCACC
GCCGACACGC ATTTCCGGCT GCCGACCGTT GACACCCCGG AAACCGAGCT CGACGCCACC
GATTTCGCTG CCTTCCGGCC GCTCGCGGAT CTGCCGATGG CGATGACTGC ACATGTTGTG
TTTAGCGCGA TCGATGCCGC CCATCCGGCC ACGACTTCTG CGACAATGAT CCAGCGGGTG
ATTCGCGAGC GGATCGGGTT CCAGGGTTTG TTGATGAGTG ATGACGTTTC CATGAACGCT
CTGGCCGGAT CGATCGCCGA GCGCACGCGC GCGATCGTCG CGGCGGGGTG CGACATGGTT
CTGCATTGCA ACGGCAAGCT CGACGAGATG CAGGCCGTCG CCGCCGAGAC GCCAGAGCTG
GCTGGCCAGG CTTTGCTCCG CGCCGATCGC GCGCTTGCGG CGCGCAAGAC CCCCTCGGGC
TTTGACCGGA TCGCCGCGCG CGCCGAGCTC GACGCCCTGA TCAACCGGCT GGGACCCGCG
AGCGCATGA
 
Protein sequence
MTMRAFITGI SGPDLTEAER AFIRAAKPWG FILFKRNVQS PAQVTALVEQ LRACAGRAEA 
PVLIDQEGGR VQRLGPPHWP VYPAGVVFDR LYDLDSSLGP RAAWLSARLI ADDLQQLGIT
VDCLPLADVP VAGADAVIGD RAYGTTPAKV AAIARAVTDG LEQGGVLPVL KHIPGHGRAT
ADTHFRLPTV DTPETELDAT DFAAFRPLAD LPMAMTAHVV FSAIDAAHPA TTSATMIQRV
IRERIGFQGL LMSDDVSMNA LAGSIAERTR AIVAAGCDMV LHCNGKLDEM QAVAAETPEL
AGQALLRADR ALAARKTPSG FDRIAARAEL DALINRLGPA SA