Gene BURPS1106A_A1774 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1774 
Symbol 
ID4904136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1746715 
End bp1748745 
Gene Length2031 bp 
Protein Length676 aa 
Translation table11 
GC content70% 
IMG OID640144880 
ProductBCCT family transporter 
Protein accessionYP_001075808 
Protein GI126457740 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0841224 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCCAT CCCGCTCCAC GCCGCGCACG ACGCTCAAGC CGCCAGTCGT CGTGCCGTCG 
CTCGCCGTGA TCGGCGCGCT CCTCGCCGTC TGCGCGCTGC GGCCGAACGA GGCCGGCGCC
CTCTTCGGCG CGGCCCAGCA ATGGATCGTC GAGCGATTCG ACTGGTTCTA CGTGCTCGCG
ATCACGACCT TCCTGATCTT CCTCGTGCTG ATCGCCGCGA GCCGCTTCGG CAACATCAAG
CTCGGCCCCG ACGACGCGGA GCCGGAATTC AGCTTCGTGT CGTGGACCGC GATGCTGTTC
GCGGCCGGCA TGGGCATCGG CCTCATGTAC TTCGGCGTCG GCGAGCCGAT CCAGCATTAC
CTGTCGCCGC CGACGGCCGC GGGCGGCACG CCCGCCGCCG CGCGCGAGGC GATGCTGATG
TCGTTCTTCC ATTGGGGCCT GCACGCATGG GCGACCTACG GCGTGATGGG TCTCGTGCTC
GCGTACTTCG GCTTTCGCTA CAACCTGCCG CTCACGCTGC GCTCGGGCCT CTATCCGGTG
CTGCGCGAAG GCGTGAACGG CTGGATCGGG CACACGGTCG ACGCGTTCGC GCTCGTCGGC
ACGGTGGCCG GGATCGCGGT GACGCTCGGC TACGGCGTGC TGCAACTGAG CGCGGGCCTG
CACACGATCG CCGGCTGGCA AACCGATACC GATACGTTCC GCATCGGCCT CGTCACGGTG
GTCGTCGCGC TCGCCGGACT CTCCGCCGCG AGCGGGCTCG ACAAGGGCGT GCGCCGGCTG
TCCGAGCTCA ATCTGATGCT CGCGTTCGCG TTGCTCGCGT TCGTCGTCGT CGCGGGGCCG
ACGTCGTTCC TGCTGCGCGC GATCGGCGAC AACATCGGCG AATATCTGTC GAAGCTCGTG
TCGCTATCGT TTCGCACGTA CGCATACGAA GCGCCGAACG ACAAAGGCTG GTTCGGCGGC
TGGACGCTGC TCTACTGGGC GTGGTGGGTG TCGTGGTCGC CGTTCGTCGG GATGTTCATC
GCCCGGATCT CGCGCGGCCG CACGATCCGC CAGTTCGTGA TCGGCGTGCT GCTCGTGCCC
ACCGCGTTCA ATCTCGTCTG GATGACCGCG TTCGGCAACA GCGCGATCTG GCTCGACACG
CACGCGGCCG CGGGCGCGCT CGCGCAAACG GCGACCAACG TCGACGCGCT GCTGTTTCGC
TTCTTCGATT ACCTGCCGCT CACGCAACTG CTGTCGGTCG CGGCGATCGT GCTGATCGCG
GTGTTCTTCG TCACGTCGGC CGATTCGGGC GCGTTCGTGA TCGACGCGAT CGCCACGCGC
GGCGCGCCGC AATCGCCCGT GTGGCAGCGG CTCTTCTGGG CGGCCGTGCT CGGCGTGACG
GCGTCGGTGC TGCTCGTCGC GGGCGGGCTG AAGGCGCTGC AGGCGCTCAC GCTCGTCGCC
GCGCTGCCCG TCGCGCTCGT GATGCTCGCG CTCTGCTACG GGCTCTGGCG CGGCCTGAAA
GCCGACCACG CGCACTACTC GCAGGACATG GCGCCCGCCA CCAGCTTCTG GACCGGCCAG
CACTGGCGCC ACCGCCTGTC GCAGATCCTG CGGCACACGA GCGACGCCGA CGCGCGGCAG
TTCATCGCGC AGACGGTCGA GCCCGCGCTG CGCAAGGTCG CCGACGAGCT GAAGGCGGGC
GGCGTCGACG CGCACGTCGC GCGCGACGAC GACGACGCGG TGCGCCTCAC GGTGCCCGCG
CCCGCGCAGC GCGATTTCGT CTATGGCGCG CGGGTGTCGC GCAAGTCCGC GCCCGCGTTC
CGCATCCGCG AGGCGGCCGA GCCCGAGCCG CAGCGCGAGC ACGTGTGCGA GGTGCTGACG
TTCTTCGCGG ACGGGCGGCT CGGCTACGAC ATCGAGTACC TGCGCGGCGA CGAGATCATC
GCCGACGTGC TGCGGCAGTA CGAGCGCTAC GTGTCGCTCG CGGCGGACAC GCGCACGCAC
CTGCTGAACC GCGCGCCCGA GCACGCGGGG CCGGACGGTC TGTCGCGGTA G
 
Protein sequence
MPPSRSTPRT TLKPPVVVPS LAVIGALLAV CALRPNEAGA LFGAAQQWIV ERFDWFYVLA 
ITTFLIFLVL IAASRFGNIK LGPDDAEPEF SFVSWTAMLF AAGMGIGLMY FGVGEPIQHY
LSPPTAAGGT PAAAREAMLM SFFHWGLHAW ATYGVMGLVL AYFGFRYNLP LTLRSGLYPV
LREGVNGWIG HTVDAFALVG TVAGIAVTLG YGVLQLSAGL HTIAGWQTDT DTFRIGLVTV
VVALAGLSAA SGLDKGVRRL SELNLMLAFA LLAFVVVAGP TSFLLRAIGD NIGEYLSKLV
SLSFRTYAYE APNDKGWFGG WTLLYWAWWV SWSPFVGMFI ARISRGRTIR QFVIGVLLVP
TAFNLVWMTA FGNSAIWLDT HAAAGALAQT ATNVDALLFR FFDYLPLTQL LSVAAIVLIA
VFFVTSADSG AFVIDAIATR GAPQSPVWQR LFWAAVLGVT ASVLLVAGGL KALQALTLVA
ALPVALVMLA LCYGLWRGLK ADHAHYSQDM APATSFWTGQ HWRHRLSQIL RHTSDADARQ
FIAQTVEPAL RKVADELKAG GVDAHVARDD DDAVRLTVPA PAQRDFVYGA RVSRKSAPAF
RIREAAEPEP QREHVCEVLT FFADGRLGYD IEYLRGDEII ADVLRQYERY VSLAADTRTH
LLNRAPEHAG PDGLSR