Gene BBta_0016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_0016 
Symbol 
ID5152372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp18198 
End bp19568 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content67% 
IMG OID640555048 
Producthypothetical protein 
Protein accessionYP_001236227 
Protein GI148251642 
COG category[R] General function prediction only 
COG ID[COG2079] Uncharacterized protein involved in propionate catabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCAGG AAACCGCAAC GCTCGCCGCC TATGTCGCCG ACCTGAAATA TGACGATATC 
CCGCCCGAGG TACTCGACCG CGCCAAGGTG CTGACCCTGG ACTTTCTCGG CAGCGCCATC
CGTGCCCGGC GCGAGGCGGA ATCCACCCCA TCCGTGGTGA AGATGCTGGC GGCACTCAAG
CTGGATTCGG CCGGCGAGGC GACCGTGTTC GGCGATTCCA GGACGTGGAC GCCGGCGGTG
GCGGCGCTGC TGAACGGCAC GATGGGCCAT TCGCTCGATT TCGACGATAC CCACGCCGAC
TCGTCGCTGC ATCCCAGCGC CCCGGTGGTG CCGGCCGCCT TCGCCGTCGG CGAAATGGTC
GGCGCGTCGG GCCGCGAAGT GCTGACCGCG ATCGTCGCCG GCTATGAAGT GTGCTGCCGG
CTCGGCAATG CGCTCGATCC AACCTCGCAT TATGCCCGCG GGTTTCACCC GACTGCGACG
GCAGGCACCT ATGGCGCAGC CGCGGCCGCC GGCAAGCTGT TCGGCCTGTC CGAGCAGCAG
CTGATCTACG CGTTCGGCGT GTCCGGCAGC CAGGCGGCCG GCTCGCTGCA ATTCCTGGTC
AACGGCGCCT GGAACAAGCG CTACCAGGTC GGGGCGTCGG CGATGAACGG CGTGATCGCG
GCGACCTTGG CCAAGAACGA TTTCGTCGGC GCGATCGAAT CCGTCGAGGG CAAGCACGGC
CTGCTGGTCG GCTACACCGA CACGCCGCAC CGGGAGAAAG CGGTCGCCGG CCTCGGCACG
ACCTATGAGA CGATGAAGAT CGGCGTGAAG CCGTATCCGA GCTGCCGCTA TACGCATGCC
GCGATCGACG CCATCATCGC ACTGCGCCGC GAGCACAATC TGACCCCGGA CCAGGTCAAG
CGCGTCGAGA TCGGTCTGCA CCGCAACGGC ATCACCTTGA CCGGCGATGC CGCGACCAAG
CGCCACCCGA CCTCGATCGT CGGCGGCCAG TTCTCGATGT TCTTCACCGG CGCGCTGGCG
CTCGACCAGG GCCGCTTCGG CTGGGACGAC TACACGCGGC TCGGCGATGC TGCGATCAAC
AACCTCGCGA ACAAGTTCGA CGTCGTGCAG GACGACAAAT TGGAGATCGG CCGCAGCCAT
CCGTTCGGCG CGCGCGTCAC CATCACCACC GAGGATGGCG TGCACGAGCG GCTGCATGAC
GATCCGTCGG GCGAGCCGAC CTCGTTCCCC TCGGCGCAGG CCATGAGCGA CAAGTTCATC
ACCTTGGCGC GGCCGGTGCT GAGCAGCCGG GCGCAGCATT TTGCCGACGC GATCCTGGGG
CTGGAGCGGT TCGATCGTGT GGCGCAGGCC ACCGCGTTGG GGAGAGCATA G
 
Protein sequence
MAQETATLAA YVADLKYDDI PPEVLDRAKV LTLDFLGSAI RARREAESTP SVVKMLAALK 
LDSAGEATVF GDSRTWTPAV AALLNGTMGH SLDFDDTHAD SSLHPSAPVV PAAFAVGEMV
GASGREVLTA IVAGYEVCCR LGNALDPTSH YARGFHPTAT AGTYGAAAAA GKLFGLSEQQ
LIYAFGVSGS QAAGSLQFLV NGAWNKRYQV GASAMNGVIA ATLAKNDFVG AIESVEGKHG
LLVGYTDTPH REKAVAGLGT TYETMKIGVK PYPSCRYTHA AIDAIIALRR EHNLTPDQVK
RVEIGLHRNG ITLTGDAATK RHPTSIVGGQ FSMFFTGALA LDQGRFGWDD YTRLGDAAIN
NLANKFDVVQ DDKLEIGRSH PFGARVTITT EDGVHERLHD DPSGEPTSFP SAQAMSDKFI
TLARPVLSSR AQHFADAILG LERFDRVAQA TALGRA