Gene BBta_1003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_1003 
Symbol 
ID5149731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp1040901 
End bp1042151 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content65% 
IMG OID640555997 
Producthypothetical protein 
Protein accessionYP_001237165 
Protein GI148252580 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.911969 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.550503 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAGC ATGGGGACCG AGATCCTGGC AGATATGGAC GCATGAGCTT GCGATTCACA 
ATCGTTCAAT ATGCCGGAGA TTATCGGGAG GCGTTCGAGC GCCTGTCGGC GGGCGGCAAG
GAAACCTATT ACGCGCAGCG TCATTCCGTG GATTTCGTCG GATCTCTGGC CAGGAGGCTT
GAGCAGGTCG CGGTGATTTG CGCCGTCAGC GACACCGCTT ACGATGCGGT CCTGGCCAAT
GGCGTTCGTG CCGTCGGCGC GGGCTTGCGT CCGGGCTTCG ATCCGGCCGC GCTGCTGCCA
TTCGTCGCCA GGACCGAGCC GAACCGCCTG TCGATCAATT CGCCCCTGGC GCCGGTGCTG
CGATGGGCCA GGCGGAACCG GATCCGAACG ATCGTGCCGC TGGCAGATTC CTTCAACGCG
GGCGGTCTGC GCGCTGCCAT CCGCCATCGC CTGCTGGCCC GTCAGCTCAA TGATCCGCTG
ATCGAGTGGG TCGGCAATCA TGGCATCAGC GCCTGCCTGT CGCTGGCGGG CATCGGTGTG
CGGGCCGACA AGATCGTGCC GTGGGACTGG CCCCCGGCAC ATCGGCCGAC CGATTATCCG
TCGCGCAACC TGACGGGTGA CGGTCCCCGC AAGGTGTTCT ATGTCGGCAG CCTGTCGCAG
GCGAAGGGCG TCGGCGATCT CCTGGCTGCT ACGGCCAGAC TTCGCGGCCA GGGCTATCCG
GTGTCGCTGA CGCTGGCCGG GCGCGATGCC GACGGCAGCA TGGCCGCGCG GGCCCGCGCA
TTGGCGATCG AGGACGCCGT CACCTTCCTG GGCGTCGTCG CCAATGAGGA CGTTCCGCAG
CTGATGCGGG AGGCCGATCT CGTCGTCATA CCGTCGCGGC ACGAATATCC GGAAGGACTG
CCGCTGACGA TCTACGAAGC GCTGTCCGCC CGCACGCCGA TCGTCGCTTC GGATCATCCG
ATGTTTCGCA ACGCGCTGAC CGACGGCGAG AGTGCGGTGA TCTTCCGGGC AGGAGACGTG
AACCAATTGG CCGCGGCGAT TGTCAAAGTC TTGGACGATC CCGCGCTCTA CCAAGCGCTC
TCGGCAAGCT CGGAGGACGC GTGGAACCGA ATTCAACTGC CGGTCACCAT GGGTGCTTTT
GTCGCAGCCT GGCTGGAGGA TACAGTTCCT GCGCGACAGT GGCTCTCAAG TCACAGCTTG
AACTCGGGGC GCTATGGTGC CGCGATCGAG AAAGCTGTGC CGCGGAGTTG A
 
Protein sequence
MIEHGDRDPG RYGRMSLRFT IVQYAGDYRE AFERLSAGGK ETYYAQRHSV DFVGSLARRL 
EQVAVICAVS DTAYDAVLAN GVRAVGAGLR PGFDPAALLP FVARTEPNRL SINSPLAPVL
RWARRNRIRT IVPLADSFNA GGLRAAIRHR LLARQLNDPL IEWVGNHGIS ACLSLAGIGV
RADKIVPWDW PPAHRPTDYP SRNLTGDGPR KVFYVGSLSQ AKGVGDLLAA TARLRGQGYP
VSLTLAGRDA DGSMAARARA LAIEDAVTFL GVVANEDVPQ LMREADLVVI PSRHEYPEGL
PLTIYEALSA RTPIVASDHP MFRNALTDGE SAVIFRAGDV NQLAAAIVKV LDDPALYQAL
SASSEDAWNR IQLPVTMGAF VAAWLEDTVP ARQWLSSHSL NSGRYGAAIE KAVPRS