Gene Bcep18194_C7553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_C7553 
Symbol 
ID3733841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007509 
Strand
Start bp1180214 
End bp1181389 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content68% 
IMG OID637761254 
ProductABC glycine betaine/L-proline transporter, ATPase subunit 
Protein accessionYP_367241 
Protein GI78060666 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1125] ABC-type proline/glycine betaine transport systems, ATPase components 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.588989 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAAC TCGACAAACT GACCAAGACC TTCTCCCGCA AGGACGGCCA GGCCGTGCGC 
GCCGTCGACG CCGTGAGCCT GTCGGTGGCC GAGGGGGAAA TCTGCGTGTT CCTCGGCCCG
TCGGGCTGCG GCAAGACCAC CACGCTCAAG ATGATCAACC GGCTGATCGC CCCCACGTCC
GGCCGCGTGC TGATCAACGG CGAGGATACG GCGCAGCTCG ACGAGGTCGA CCTGCGGCGC
CACATCGGCT ACGTGATCCA GCAGATCGGG TTGTTCCCGA ACATGACGAT CGAGGAAAAC
ATCACCGTCG TGCCGCGCCT GCTGGGCTGG GACAAGAAGC GCTGCGCCGA ACGCGCGCGC
GACCTGATGG CGATGGTCGC GCTCGATCCG AAGCTGTACC TGAAACGCTA TCCGCGCGAG
CTCTCGGGCG GGCAGCAGCA GCGTATCGGC GTGATTCGCG CGCTGGCCGC CGATCCGCCG
GTGCTGCTGA TGGACGAGCC GTTCGGCGCG GTCGACCCGA TCAATCGCGA ATCGATCCAG
AACGAGTTTT TCCAGATGCA GCGGCAACTG AACAAGACCG TGATCATGGT GAGCCACGAT
ATCGACGAAG CGATCAAGCT GGGCGATCGC ATCGCCGTGT TCCGGCGCGG CCAGCTCGTG
CAATACGATC ACCCCGACAC GCTGCTCGCA CGCCCGCGCG ACGAGTTCGT CGCGCAGTTC
GTGGGCCAGG ACAGCACGCT CAAGCGCCTG CTGCTCGTGA AGGCCGGTGA CGCCGCCACG
CAACCCGAAA CGGCACGGGT CGATACGCCG CTCGCGCACG CGCTCGCCGT GATGGACGAA
ACCGACTGCC GCTACCTGAC CGTGCTCGAC GACGCGGGCC GCGCGCTCGG CTACGTGACG
CGCCGCGCGG CGCGCTCCGA CGGCGGCGTG TGCGGCGAGC GCGTGACGCC GTTCCCGGCC
GGTGTCGCGG CGGACGACAA CCTGCGCATC GTGCTGTCGA AGATGTACCA GTACAGCGCG
TCGTGGATGC CGGTGCTCGA CCCCGACGGC GCATGGCTCG GCGAGATCAC GCAGGACTCG
ATCGCCGCGT ACCTCAGCTC GGGCCGCTCG CGCCGGCAGA CGGGCCAGCC GCCCGGCAGC
CCTGCCGAAG CGGTCGTGGC AACCGCCACG CACTGA
 
Protein sequence
MIELDKLTKT FSRKDGQAVR AVDAVSLSVA EGEICVFLGP SGCGKTTTLK MINRLIAPTS 
GRVLINGEDT AQLDEVDLRR HIGYVIQQIG LFPNMTIEEN ITVVPRLLGW DKKRCAERAR
DLMAMVALDP KLYLKRYPRE LSGGQQQRIG VIRALAADPP VLLMDEPFGA VDPINRESIQ
NEFFQMQRQL NKTVIMVSHD IDEAIKLGDR IAVFRRGQLV QYDHPDTLLA RPRDEFVAQF
VGQDSTLKRL LLVKAGDAAT QPETARVDTP LAHALAVMDE TDCRYLTVLD DAGRALGYVT
RRAARSDGGV CGERVTPFPA GVAADDNLRI VLSKMYQYSA SWMPVLDPDG AWLGEITQDS
IAAYLSSGRS RRQTGQPPGS PAEAVVATAT H