Gene BBta_1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_1042 
Symbol 
ID5150716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp1085092 
End bp1086219 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content64% 
IMG OID640556035 
Productputative glycosyl transferase group 1 
Protein accessionYP_001237203 
Protein GI148252618 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.508131 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGCTT GTTTTGCGGA TTTGAGACCG ACGATGCGCG GCCGACGACT GTTCATTAAT 
GGAAAATATC TCGGCGCCGG CCCTACCGGC GTGCACCGGG TCGCTGACGA GCTCATTGCT
CAGCTCGCGA GCCGGCGCGA CGAGCTTGCT GGACTGTTCG CGGAGCCCAC CACCATCGTG
GTGCCGAGGA CCGTGGCAAA GTCCCGTCAC CAGTTCGAGA TGAAGCAGGC GGGTTTCCTC
ACAGGCCAGC TGTGGGAGCA ACTCGACCTC CCGAGGGCCG CGCGGTCGGG GCTGTTGCTC
AGCCTGTGCA ATCTCGCGCC GATGATCAGC ACGTCCGCGG TCACCATGAT CCATGATGCG
CAGACCTTCT CGACGCCGGA ATCATATTCG AGCGCGTTTG CCCGCTGGTA TCGCTACGTC
CTGCCGACCA TCGGCAGGAG ACATCGCAAG ATACTCGCCG TCTCCCACTT CACCGCGAAA
CAACTGGTCG ACTTCGGGGT GACGACGGCC GACCGGATCG CGGTGGTTCC CAACGGGGTC
GATCATGTCC TGCGGGTTCC GTCCGAGCCC GGCATCTTCG ATCGGCTCCA GCTTGGCGAG
ACGAAATTCG TCGTCGCGTT GGCCAATACG CAGCGGCACA AGAATGTCGG TCTGTTGCTG
AAGGCGTTCG CGGATGAACG CCTGAAGCCG ATCCGGCTGG TTCTTGTCGG CGGCGAGGGG
GCCCAGGCCT TCGCGGCGCT GGGCCATGTC GTTCCCGACA ATGTCCGGTT TGCCGGCAGG
GTCAGCGATG GCGAGCTCCG CGCACTGTTG GAACGGGCCG TCTGCGTCGC CTTTCCCTCG
ACCACCGAAG GCTTTGGCCT CCCGCCATTG GAAGGAATGC TGCTTGGCTG TCCCGCGGTC
GTGGCGCCCT GCGGGGCTTT GCCGGAAGTG TGCGGCGACT CCGCGATCTA CGCGTCGGCT
GATGATCCGC GAGAATGGGT CGAGGCGATC TCGTCGTTGG CCAACGATCC ACTATGCTGG
AACCGCTATT CCCAGAACGG GCGCGAGAGG GCGAACATAT TCACCTGGCG CGCGGCCGGG
GACAGGCTGA TGGACGTGCT GAGATCCATC CAGGGAACCC AGATCTAG
 
Protein sequence
MEACFADLRP TMRGRRLFIN GKYLGAGPTG VHRVADELIA QLASRRDELA GLFAEPTTIV 
VPRTVAKSRH QFEMKQAGFL TGQLWEQLDL PRAARSGLLL SLCNLAPMIS TSAVTMIHDA
QTFSTPESYS SAFARWYRYV LPTIGRRHRK ILAVSHFTAK QLVDFGVTTA DRIAVVPNGV
DHVLRVPSEP GIFDRLQLGE TKFVVALANT QRHKNVGLLL KAFADERLKP IRLVLVGGEG
AQAFAALGHV VPDNVRFAGR VSDGELRALL ERAVCVAFPS TTEGFGLPPL EGMLLGCPAV
VAPCGALPEV CGDSAIYASA DDPREWVEAI SSLANDPLCW NRYSQNGRER ANIFTWRAAG
DRLMDVLRSI QGTQI