Gene BBta_2139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_2139 
Symbol 
ID5155307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp2217902 
End bp2219092 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content63% 
IMG OID640557076 
Productputative L-arabinose transport system permease protein araH 
Protein accessionYP_001238232 
Protein GI148253647 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4214] ABC-type xylose transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA AGACGGTTTC GCTCCCGGAA GAGAAGTCGC ACGCCGGCTT CCTGAAGAAT 
AATCTGCGCA GCTACGGCAT GCTGATGTCG TTGTTCGTCA TCATGCTGTT CTTCCAGTTC
ATGACCGACG GCACTCTGCT GCAGCCGCTC AACCTCACCA ACCTGGTGCT GCAGAACAGC
TACATCGTCA TCATGGCGCT CGGCATGCTG CTGGTCATCG TGACCGGGCA TATCGACCTC
TCGGTCGGTT CGGTGGCGGG CTTCGTCGGC GCGGTCGCGG CCGTGCTGAT GGTGCGCTAT
CACCTCGACT ATCCGCTCGC GATCATCGCC TGCCTGATCG TCGGCGCCGC GATCGGCGCG
GCCCAAGGCT ATTGGGTGGC GTATTTCGGC ATTCCCTCCT TCATCGTCAC GCTTGCCGGC
ATGCTCGTGT TCAAGGGCTT GGCGCTGGCG CTGCTGCAGG GACAGTCGGT CGGTCCGTTC
CCGGCGACGT TCCAGAAACT GTCGTCGGGC TTCATTCCGG AGCTCATTCC AAGCTCCGGC
AATCTCAATC TGACCTCGCT CGCCATCGGC GCGGTGCTGA CGCTGGTGCT GGTCTATGCC
AGCGTGAAGG GACGCGCGCG TGAGGTCGAG CATGGCATCG AGGTCGAGCC GTTCGGCTTC
TTCGCCGCCA AGAACGTCGT TCTGGCCGGT GCGCTGATGT ATTTCACCTA TCTGATCGCC
TCGCATCGCG GCCTGCCGAA CGTACTCGTG ATCATGACCG CGCTGATCGC GCTCTACGGC
TTCATGACGC GGCGCACCGT CGTCGGCCGG CAGATCTACG CCGTCGGCGG CAATGCCAAG
GCGGCGAAGC TGTCGGGCAT CAAGACCGAG CGCCTGGTTT TCATGACCTT CGTCAACATG
GGCGTGCTGG CGGCGCTGGC CGGCCTGATC TTCGCCGCGC GGCTCAACAC CGCGACGCCG
AAGGCGGGGC TCGGCTTCGA GCTCGACGTC ATCGCGGCCT GCTTCATCGG TGGCGCCTCC
GCCTATGGCG GTGTCGGACG GGTCGGCGGC GCGGTCGTGG GTGCCATGAT CATGGGCGTG
ATGAACAACG GCATGTCGAT CCTTGGCATC GGCATCGACT ACCAGCAGGT GATCAAGGGG
CTCGTGCTGC TCGGCGCCGT CTGCATCGAC GTCTACAACC AGCGTCGATA A
 
Protein sequence
MTDKTVSLPE EKSHAGFLKN NLRSYGMLMS LFVIMLFFQF MTDGTLLQPL NLTNLVLQNS 
YIVIMALGML LVIVTGHIDL SVGSVAGFVG AVAAVLMVRY HLDYPLAIIA CLIVGAAIGA
AQGYWVAYFG IPSFIVTLAG MLVFKGLALA LLQGQSVGPF PATFQKLSSG FIPELIPSSG
NLNLTSLAIG AVLTLVLVYA SVKGRAREVE HGIEVEPFGF FAAKNVVLAG ALMYFTYLIA
SHRGLPNVLV IMTALIALYG FMTRRTVVGR QIYAVGGNAK AAKLSGIKTE RLVFMTFVNM
GVLAALAGLI FAARLNTATP KAGLGFELDV IAACFIGGAS AYGGVGRVGG AVVGAMIMGV
MNNGMSILGI GIDYQQVIKG LVLLGAVCID VYNQRR