Gene BBta_1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_1004 
Symbol 
ID5149732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp1042242 
End bp1043468 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content68% 
IMG OID640555998 
Productputative glycosyl transferase, group 1 family protein 
Protein accessionYP_001237166 
Protein GI148252581 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.531302 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCG CCTATCTGGT GAACCAATAT CCGAAGGTCA GCCACAGCTT CATTCGCCGT 
GAAATCCTCG CCCTGGAGCG CGAGGGCCTG GAAGTCACGC GAATCTCCAT TCGCGGCTGG
GACAATGACC TGGTCGATGA GGCGGATCTC GCCGAGCGCG CGAGGACACG TTACGTCCTG
CAGGAGGGCG CCGTCTCGAT CGCGTTGGCC ACGGCCTTCG CGGCGGTCAC GCGGCCGCGC
GCCTTCGCGT GGGCGTTGCT GCTCGCATTG CGCATGGCGC GGCGTGCCGA ACGGTCGCTG
CCCTATCACC TGATGTATCT GGCCGAGGCC TGCCTGATCC TGCGCTGGCT GCGCCAGGCC
GGCGTCGCGC ATGTTCATGC CCATTTCGGC ACCAACTCCG CGGAGGTGGC GATGCTGGTG
CACGCGCTTG GCGGCCCGCC TTTCAGCTTC ACGGTGCATG GGCCCGAGGA GTTCGACAAG
GCACCGCTGT TGGGGCTTGC GGCGAAGATT CGCCACGCCG CCTTTGTCGT CGCGATCAGC
TCATTTGGCC GCAGCCAATT GCTGCGCCTG GTCGAGCACG CGCATTGGGG GAAGATCCAG
GTGGTCCGCT GCGGCCTGGA GCAGACCGAC TTCGAGACGC ACTCCGACAT CGACGACAGC
CGGACCCTGG TCTGTGTGGG GCGGCTCTGC GAGCAGAAAG GACAGCTGAT CCTGATCGAG
GCCGCTCGGC GGCTGGCCGA GGCGAATGTC GACTTCACGC TGACGCTCGT GGGCGACGGC
GAGCTCCGCC AGGACATCGC CGCACTGATC GACAAGCACG GGCTTGCCGA CCGCATCCGC
ATCACCGGCT GGGCCACCGC GGGGGAGGTG CGTTCACACC TTCTGCGCGG GCGCGCGCTG
GTGCTCCCGA GCTTCGCGGA GGGGCTGCCG GTCGTGATCA TGGAGGCGAT GGCGCTGCGC
CGTCCGGTCA TCTCGACCTA TGTCGCCGGG ATTCCCGAGC TTGTCAGGGA CCAGGAGCAC
GGCTGGCTTG TTCCGGCCGG CGATGCTGAA GCGCTCGCTG CGGCGATACG CCGCTGCCTC
GACAGCGCCC CGGCAGAGCT CCAGTCCATG GGACGAGCCG CCTACGCCCG CGTCCGCGCG
CAGCACCAGA TCGAGACCTC GGCGCAGCAG CTCAAACGGC TGTTCGAAGC CGGCGCGAGC
GAAGCGCGTT CTTCCCAAAC CGGCTGA
 
Protein sequence
MRIAYLVNQY PKVSHSFIRR EILALEREGL EVTRISIRGW DNDLVDEADL AERARTRYVL 
QEGAVSIALA TAFAAVTRPR AFAWALLLAL RMARRAERSL PYHLMYLAEA CLILRWLRQA
GVAHVHAHFG TNSAEVAMLV HALGGPPFSF TVHGPEEFDK APLLGLAAKI RHAAFVVAIS
SFGRSQLLRL VEHAHWGKIQ VVRCGLEQTD FETHSDIDDS RTLVCVGRLC EQKGQLILIE
AARRLAEANV DFTLTLVGDG ELRQDIAALI DKHGLADRIR ITGWATAGEV RSHLLRGRAL
VLPSFAEGLP VVIMEAMALR RPVISTYVAG IPELVRDQEH GWLVPAGDAE ALAAAIRRCL
DSAPAELQSM GRAAYARVRA QHQIETSAQQ LKRLFEAGAS EARSSQTG