Gene BBta_4604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4604 
Symbol 
ID5149412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp4823369 
End bp4824589 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content66% 
IMG OID640559404 
Producthypothetical protein 
Protein accessionYP_001240538 
Protein GI148255953 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGG CCGCCGTGAT CGAAAACCGG ACGGCGCAGT CGCCGCTGTC TGGACTTGCA 
CAGGTCGAGA TCGTCTCTGA TCTCGCCGCG GCCGAACCGG CCTGGCGCAT TCTCGAGGCA
CCCGACCACA TCTCGACGCC CTACCAGCGC TTCGACCTGC TCGCCGCGTG GCAGCACGAG
GTCGGCGCCC GCGAGCAGGC GACGCCCTTC ATCGTCATCG CTCGCGATGC CAAGCAGCAG
CCGTTGCTGC TGCTGCCGCT GGCGCTGACG CACGCGTTCG GGGCGCGGGT CGCGAGCTTC
ATGGGCGGCA AGCACACCAC CTTCAACATG CCGCTGATGC ACCGCGCGTT TGCGGCGCGT
GCCAGCGTTG GCGATCTCGA ATTCCTGCTT GCCGGACTGC GTGATCATGG CGGCGTGGAC
GTGCTGGCGT TGACGCAACA GCCGCTGCGC TGGAGCACGA TCGCCAACCC GCTGGCGCAA
TGGCCGCGCC AGCCATCCGT GAACGACTGC CCGGTGCTGT TGATGCCCCC CGGCGCCGCA
TCGACCGCTT TGTTGTCGAA CTCATTCCGC AAGCGGCTCA AGAGCAAGGA GAAGAAGCTG
CAGGCGCTGC CCGGCTATCG CTACATGATC GCCAGCAGCG ATGCCGAGAT CGCCGAGCTG
CTGGACTGGT TCTTCCGGAT CAAGCCGATC CGCATGGCCG AGCAGAAGCT GCCCAACGTC
TTCGCCGAAC CCGGCATCGA AGCCTTCGTG CGCGCCGCAT GCCTGGCCAA GCTCAGCTGC
GGTCATCGCG CCATCGAGAT TCATGCGCTG CGCTGCGACG ACGAGATCAT CGCGCTGTTC
GCCGGCGTCG CCGATGGCGA GCGCTTCTCG ATGATGTTCA ACACCTATAC GCTGTCGGAG
AATGCCCGCT GGAGTCCGGG ACTGATCCTG ATGCGCTCGA TCATCGATCA TTATGCGCAA
AGCGGGTTCC GCGCGCTCGA TCTCGGCATT GGCTCCGACG ACTACAAGCG GATGTTCTGC
AAGGATGACG AGCCGATCTT CGACAGCTAT CTGTCCTTGA CTGCGCGCGG GCTCGTGGCG
GCGCGGACGA TGGCCGCGCT TGGCCGCGCC AAGCATGCCG TCAAGCACAG CCCTGCTCTG
TTTCGTCTGG CGCAGCGCGT CCGCGGCGCG CTGCAGTCTG GCGGCAGCGC CGCCAGAGCC
GAGGAACGGG CCGACGATTA G
 
Protein sequence
MTMAAVIENR TAQSPLSGLA QVEIVSDLAA AEPAWRILEA PDHISTPYQR FDLLAAWQHE 
VGAREQATPF IVIARDAKQQ PLLLLPLALT HAFGARVASF MGGKHTTFNM PLMHRAFAAR
ASVGDLEFLL AGLRDHGGVD VLALTQQPLR WSTIANPLAQ WPRQPSVNDC PVLLMPPGAA
STALLSNSFR KRLKSKEKKL QALPGYRYMI ASSDAEIAEL LDWFFRIKPI RMAEQKLPNV
FAEPGIEAFV RAACLAKLSC GHRAIEIHAL RCDDEIIALF AGVADGERFS MMFNTYTLSE
NARWSPGLIL MRSIIDHYAQ SGFRALDLGI GSDDYKRMFC KDDEPIFDSY LSLTARGLVA
ARTMAALGRA KHAVKHSPAL FRLAQRVRGA LQSGGSAARA EERADD