Gene BBta_4520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4520 
Symbol 
ID5151436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp4734064 
End bp4735410 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content66% 
IMG OID640559321 
Producthypothetical protein 
Protein accessionYP_001240458 
Protein GI148255873 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGACGCG GCTTTGTGAG GACTGTGTCG GCGCTGCTCT GGCGGGGCGT GATCGTCGCC 
GCTCTCGGCG CGATCGGCAT CACCGGGGTG CCGTCAGGCG CAAGTGCGCA GTCGCTCGCT
GGCGCGGACA CCGTCTATGC CTCGGCTCCG CCGCGACTGC TGCAGATCCG CACCCTCCTG
GCCGATGCCG GCCGCCAGAC CTCGACCGGC TCCGGCTTTC TGGTCTCGGC CGACGGGCTT
GCGATCACCA ACTACCACGT GGTCTCCGAC GCGGCGCTCG AGCCGAAGAC CTATCGGCTC
GAATATACCG GAGCAGATGG CACGCAGGGC GGCGTGACTC TGCTTGCGGT CGATCTGCCC
AATGATCTCG CACTCGTGCG CGTCGACAAG CACGACGCGC CGTTCTTCAC CTTCGACAAA
GCGGCGCTCG AGGGCAGCCT GCCCAAGGGC GAGCGTCTCT ATTCGCTCGG CAACCCGCTG
GACCTCGGCT TTACCATCAT CGAAGGGACC TATAACGGCC TCGTCGAGCA CAGCTACAAC
GACCATATTC ACTTCACCGG CGCGCTCAAT CCCGGCATGA GCGGCGGTCC CGCCGTGAAC
GCCCAAGGGC AGGTGGTCGG CGTCAATGTC GCAACGCGAC GCGGCGGTCA GCTGATCAGC
TTTCTGGTGC CCGCCCGCTT CGCCGCCGCT CTGCTGATCC GCGGCAAGGA CATCAGGCCG
GAGGCCGCGG ATCTGCGCAA GGACGTCGTT GCCCAGCTCG CCAGCTGGCG CGGCGCGCTG
TACAAATCCC TGGCCGAGGA AGGCTTCCAT GATCGCGTGT TCGGCTCCTA TCAGGCGCCG
GAAACGCATG CGGCGTGGTT CGAATGCTGG GCCAGCACCA ATGCCAGCGC CTCGCCGAAG
CCGCGGGCCA GCATCAATTC GACCAGTTGC AAGGCCGATG CGAGCGTCTA TGTCGCCTCC
GACCTCAACA CCGGTACGGT CGAAATCAAT CATTCCTACG CGAAGTCGAT CGACCTCAAT
CAGTTTCAAT TCGCCACCGT GCTGACGCAG CTGGCGCAGC CGCGGCTGAC CTTGGGCGGC
ACGTTCCGCA AATGGTACAC GCCGCAGCAC TGCCATGAGG ATTTCGTCGG CATCGCGCCG
CCGGCCGATC ACCCACCGCT GCGCGTGCTC TGGTGCGCGC AGGGCTATCG CGAGTTCGAC
GGCCTCTATG ACGTCGCGGT TGTCGCGGTC ACGCAGGACC GTGCCGACGA GGCGCTCGTC
TCCCGCCTGA ATCTGCAGGC GATCGCCTAT GACGACGCGT TGCGGCTCGG CAGGAGCTTT
CTCGAACGGC TGCAGGTCGC CCGATGA
 
Protein sequence
MGRGFVRTVS ALLWRGVIVA ALGAIGITGV PSGASAQSLA GADTVYASAP PRLLQIRTLL 
ADAGRQTSTG SGFLVSADGL AITNYHVVSD AALEPKTYRL EYTGADGTQG GVTLLAVDLP
NDLALVRVDK HDAPFFTFDK AALEGSLPKG ERLYSLGNPL DLGFTIIEGT YNGLVEHSYN
DHIHFTGALN PGMSGGPAVN AQGQVVGVNV ATRRGGQLIS FLVPARFAAA LLIRGKDIRP
EAADLRKDVV AQLASWRGAL YKSLAEEGFH DRVFGSYQAP ETHAAWFECW ASTNASASPK
PRASINSTSC KADASVYVAS DLNTGTVEIN HSYAKSIDLN QFQFATVLTQ LAQPRLTLGG
TFRKWYTPQH CHEDFVGIAP PADHPPLRVL WCAQGYREFD GLYDVAVVAV TQDRADEALV
SRLNLQAIAY DDALRLGRSF LERLQVAR