Gene BamMC406_5092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBamMC406_5092 
Symbol 
ID6181515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia ambifaria MC40-6 
KingdomBacteria 
Replicon accessionNC_010552 
Strand
Start bp2259876 
End bp2261045 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content66% 
IMG OID641684844 
Productglycine betaine/L-proline ABC transporter, ATPase subunit 
Protein accessionYP_001811754 
Protein GI172064103 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.571422 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCCC CCAAGGTCGT GGTCGAAGGT CTGTGCAAGG TATTTGGAAG TAACCCGCAG 
CAGGCGCTCG ACATGCTCGC CGCCGGTGCA ACGAAAGACG ATGTGCTCAA GCGTACCGGT
CAGGTCGTCG GCGTGCACAA CGTATCGTTC GACGTGCAGG AAGGCGAAAT ATTCGTGCTG
ATGGGCCTGT CCGGCTCCGG CAAATCCACG TTGATCCGCC TCGTGAACCG GCTGGTCGAT
CCCAGCGCCG GCAAGGTGCT GATCGACGGG CTCGACGTTG CGTCGGCACG CCGCTCGGAG
CTGACCGCGC TGCGCCGCAA GGACATGAGC ATGGTGTTCC AGTCGTTCGC GCTGATGCCG
CATCGCACCG TGGTGTCGAA CGCCGCGTTC GGCCTCGAGG TCGGCGGCGT CGGCAAGAAG
GAGCGCGAAC GCCGGGCAAT GGACGTGCTC GAGCAGGTCG GTCTCGCACC GTTCGCACAC
AAGCTGCCGT CCGAGCTGTC GGGCGGGATG CAGCAGCGCG TCGGCCTGGC CCGCGCGCTC
GCCGTGAACC CGTCGCTGAT GATCATGGAC GAGGCGTTCT CCGCGCTCGA TCCGCTCAAG
CGCCGCGAGA TGCAGGACGT GCTGCTGCAA CTGCAGAAGG AACAGCGCCG CACGATCATG
TTCGTGTCGC ACGATCTGGA AGAGGCGCTG CGCATCGGCA ACCGCATCGC GATCATGGAA
GGCGGCCGGC TCGTGCAGGT CGGCACGCCG CAGGACATCA TCGCGAACCC GGCCGACGAC
TACGTGCGCG CATTCTTCGA CGGCATCGAC ACCAGCCGCT ACCTCACCGC CGGCGACCTG
ATGCAGACGG GCGCCGTGCC GACCATGTCG AAGTTCGATG CGGCGAACGT CGCGGCGACG
CTGAACGGCA GCGCCGAATA CGCGTTCGTG CTCGACGCCG CACGCAAGAT CCGCGGCTTC
GTCACGCGCG ATGCGCTCGG TCAGGCCACG CCGTCCGTGC GGCCGATCGA AAGCATCCGG
CGCGACGCGA CGCTCGATCA TGTCGTCGCG CGCGTGGTCG CAAGCCCGAA TGCACTGCCC
GTCGTCGACG ACGACGGCTG TTACTGCGGT TCGGTCGACC GCGCACTCAT CCTGAAAGCC
ATCACGCGTT CGCGAGGTTC CCATGTCTGA
 
Protein sequence
MDAPKVVVEG LCKVFGSNPQ QALDMLAAGA TKDDVLKRTG QVVGVHNVSF DVQEGEIFVL 
MGLSGSGKST LIRLVNRLVD PSAGKVLIDG LDVASARRSE LTALRRKDMS MVFQSFALMP
HRTVVSNAAF GLEVGGVGKK ERERRAMDVL EQVGLAPFAH KLPSELSGGM QQRVGLARAL
AVNPSLMIMD EAFSALDPLK RREMQDVLLQ LQKEQRRTIM FVSHDLEEAL RIGNRIAIME
GGRLVQVGTP QDIIANPADD YVRAFFDGID TSRYLTAGDL MQTGAVPTMS KFDAANVAAT
LNGSAEYAFV LDAARKIRGF VTRDALGQAT PSVRPIESIR RDATLDHVVA RVVASPNALP
VVDDDGCYCG SVDRALILKA ITRSRGSHV