Gene BBta_6601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_6601 
Symbol 
ID5154954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp6877963 
End bp6879996 
Gene Length2034 bp 
Protein Length677 aa 
Translation table11 
GC content64% 
IMG OID640561296 
Productputative phage terminase large subunit 
Protein accessionYP_001242410 
Protein GI148257825 
COG category[R] General function prediction only 
COG ID[COG5525] Bacteriophage tail assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACGTT TCAAACACGC GGCCCTCGCG CTCTATGCGC GCGAGGTCGC GGCGCTTGCG 
CGCCCGCCGC GCAAGGTGCT GCCGGCCGAG TGGGCGGCGC AAAACCTGAT CGTGCCCGAC
GGCCCGCGAG CCAACACGCT TTGGGACCCT ACGCTTACGC CGTACGTCGT CGAGCCGTTG
AACAATTCCG GCCCGGACTC GCCGGTCAAC AAAGAGGCGA TCAAAAAGAG CGCTCAAACC
GGGTTCACGG TTATGGCGAT CGCTGTCGTC GGCTCGTCGA TCGATACCGA TCCCGCCGGC
GGTATCTTGC TTGTGCAGCC AACCGACGGC GCGCTCGCCG ACTTCATCGC CGACAAACTC
AATCCGGCGA TCGAGCAATC GAAGGCGCTG AAAGCGCGGG TCAAGCCGCA AGTGTCGCGC
TCGGGCGAAG GCTCGACGAC GTATCTCAAG CGCTACCCCG GCGGATCGAT GGCGCTCGCG
ATCGCCAACT CGACGGCGGA TTTGCGCTCA AAGACCAAGC GAAAGATCAT CAAAGACGAG
GCGAGCGAAT ATCCGGCCGA TCTCGACGGG CAGGGATCGC CGCACGCGAT GATCGAGGCG
CGTTACGAGT CGTTCCTCGC CACCGGCGAC TGGAAAGAGA TCAACATCTC GACGCCGACG
GTGAAAGGCG CTTGCTACAT CGACGAGCAA TTCAACGCCG GCGATCAGCG CTATTGGCAT
GTGAAATGCC CGCAATGTGA CGAAAAATTC GCGTTCCGAT GGGGGCCGAA TTTCAAATTC
AACGAGCAAT TCCCGTACGC GGCGCACTAC ATCGCGCCGT GTTGCGGCTA TCCGGTGCAG
GCGCACGAGA AAAACGATCT CGTGCGCAAG GGCGAGTGGA TCGCGACGGC GCCGGCACCG
GGCAAATTCC CGTCGTATCA TATCGACGCG ATGTCCTCGC CGTTCGTGCC ATGGGACAAG
ATCGCCGAGC GATGGATCGC CGCGCAGTCC GATCCCGGAA AGCTGAAAGC GTTCTACAAC
CTGACGCTCG GCGAGGCGTA CGAAATGAAG GGCGACGCGC CCGATCATGT TCGCCTCTTG
GAGCGGCGCG AGGATTACGT GCGCGGGCGC ATCCCGCCGC GCGGCCTGAT GCTCACGGCC
GCCGCCGACG TGCAGATGCG CGGCATCTAT GTCGAGGTTG TTGCATGGGC GCCGAACCGC
GAGTCGTGGG TGGTGTTCAC CGACGTTCTG GAAGGCGACA CCACCGACGC CAACGCGGGC
GCGTTCCTGA AACTCGCCGA GATTTATGAT CGCGAGTGGC CGGATGCGTT CGGCGGCAAG
CGTCGCGTCG ATGGCTTCGC CGTGGACTCG GGCTTTCGCT CGCATGTCGT CTATCATTGG
TGCGCCTCGC GTCATCTCGC GTATGCGGTT GACGGCCGCG ACGGTTGGCA TTTGCCGGCG
ATCGGCACGC CGAGCGTCAA AGACATCGAC TTGGACGGCC GCAAGCTTGG CTTTGCGAAG
CTTTGGCCCG TCGGCACATG GTCGCTCAAG GGCCATTGGT ACGAGGATTT GAGGCGCGAG
GGCAAGGCGG CCGGGCACGA GGTTGATCCG CCGGGTTACT GCCATTTCGG CAAATGGCTC
GACGAAATTT ACTTCAAGCA GGTTACGGCC GAATACCTGG CTGACATCAA ATCGCGCGGG
CGGGTTTCGA AGGGTTGGCG CTTGCGCGGC AACCAAGACA ACCATTTCCT CGACTGCAGG
ATCTACAACA TGGCGATCGC CGATCACCTC GGCCTTTCGC GCATGACGGC AGACGAGTGG
AAAATTCTAG CTCGCGATCG AGCGCCGGCG ATCAAGCAAG GCGACTTGTT CGCCCCGCCG
CCGCTCGCTG TTCAAGTTGC GTCGTCGAGT CCCGCGTCCG CAACACCGGC GCCCGTCGAG
GCGCCGGCCG ATCCGCCGCC GGCCGATCCG CCGGCGGCGA TCGCGCCCGA CGAGCCGCAA
GGCTCGGGTT GGCTTGGCCG CGACACGAGC GGATGGCTCG GCGGCGGCTG GTGA
 
Protein sequence
MLRFKHAALA LYAREVAALA RPPRKVLPAE WAAQNLIVPD GPRANTLWDP TLTPYVVEPL 
NNSGPDSPVN KEAIKKSAQT GFTVMAIAVV GSSIDTDPAG GILLVQPTDG ALADFIADKL
NPAIEQSKAL KARVKPQVSR SGEGSTTYLK RYPGGSMALA IANSTADLRS KTKRKIIKDE
ASEYPADLDG QGSPHAMIEA RYESFLATGD WKEINISTPT VKGACYIDEQ FNAGDQRYWH
VKCPQCDEKF AFRWGPNFKF NEQFPYAAHY IAPCCGYPVQ AHEKNDLVRK GEWIATAPAP
GKFPSYHIDA MSSPFVPWDK IAERWIAAQS DPGKLKAFYN LTLGEAYEMK GDAPDHVRLL
ERREDYVRGR IPPRGLMLTA AADVQMRGIY VEVVAWAPNR ESWVVFTDVL EGDTTDANAG
AFLKLAEIYD REWPDAFGGK RRVDGFAVDS GFRSHVVYHW CASRHLAYAV DGRDGWHLPA
IGTPSVKDID LDGRKLGFAK LWPVGTWSLK GHWYEDLRRE GKAAGHEVDP PGYCHFGKWL
DEIYFKQVTA EYLADIKSRG RVSKGWRLRG NQDNHFLDCR IYNMAIADHL GLSRMTADEW
KILARDRAPA IKQGDLFAPP PLAVQVASSS PASATPAPVE APADPPPADP PAAIAPDEPQ
GSGWLGRDTS GWLGGGW