Gene BURPS1106A_4012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_4012 
Symbol 
ID4901084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp3916143 
End bp3917552 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content68% 
IMG OID640137238 
Productputative ethanolamine permease 
Protein accessionYP_001068231 
Protein GI126454164 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID[TIGR00908] ethanolamine permease 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGACAG AGTCGAATGG CCGCCCCGGC GCGGGCGGTG GCGCAGCGCG GCCGGCGCTT 
CAGCAGACGC TCGGCACGTG GCAGCTGTGG GGAATTGCCG TCGGCCTCGT GATTTCGGGC
GAGTACTTCG GCTGGAGTTA CGGCTGGGCG AGCGCGGGCA CGCTCGGCTT CGTCGTCACC
GCGCTGTTCG TCGCGGCGAT GTACACGACC TTCATCTTCA GCTTCACCGA GCTCACGACG
TCGATTCCGC ACGCGGGCGG CCCGTTCGCC TATGCGCGGC GCGCGTTCGG CCCGGCGGGC
GGCTATCTGG CGGGCGTCGC GACCCTTGTC GAGTTCGTGT TCGCGCCGCC CGCGATCGCG
CTCGCGATCG GCGCGTACCT GCACGTGCAG TTTTCCGGCC TCGAGCCGAA GCACGCGGCG
ATGGGCGCGT ACCTCGTGTT CATGGCGCTG AATATCGTCG GCGTGCAGAT CGCCGCGACG
TTCGAGCTCG TCGTCACGCT GCTCGCGATC TTCGAGCTGC TCGTGTTCAT GGGCGTCGTA
TCGCCGGGCT TCGCCTGGAG CAACTTCGTG AAGGGCGGCT GGGCGGGCGC CGATCACTTC
AGCGCCGGCG CGTTCCATGG CATGTTCGCG GCGATCCCGT TCGCGATCTG GTTCTTCCTC
GCGATCGAGG GTGTCGCGAT GGCGGCCGAG GAGGCGAAGC ACCCGAAACG CTCGATTCCG
ATCGCGTACG TGGCCGGCAT CCTGACGCTC GTGGCGCTCG CGATCGGCGT GATGGTGTTC
GCGGGCGGCG CGGGCGACTG GACCAAGCTC GCGAATATCA ACGATCCGCT GCCGCAGGCG
ATGAAGTACA TCGTCGGCGC GAACAGCGGC TGGATGCACA TGCTCGTGTG GCTCGGCCTG
TTCGGCCTCG TCGCGTCGTT CCACGGGATC ATTCTCGGCT ATTCGCGCCA GATCTTCGCG
CTCGCCCGCG AAGGTTACCT GCCCGAATGG CTCGCGAAGG TGCACCCGCG CTTCAAGACG
CCTTATCGCG CGATCCTCGC GGGCGGCGTG GTCGGCATCG CCGCGATCTA CAGCGACGAG
CTGATCCAGT TCGGCGGCCA GACGCTCACC GCGAACATCG TGACGATGTC CGTGTTCGGC
GCTATCGTGA TGTACATCGT CAGCATGGCC GCGCTCTTCA AGCTGCGCCG CGTGCAGCCG
AGGATGGAGC GCCCGTTCCG CGCGCCGCTG TATCCGTTCT TCCCGGCGTT CGCGCTCGTC
GCGGCGCTCG TGTGCCTCGG CACGATGGTG TACTTCAACG CGCTCGTCGC GTCGATCTTC
GTCGCGTTCG TCGCGCTCGG GTACGGCTAC TTCCTCGCGA CGCGCGCGCA GCGCGAGGCC
GCGCCCGCCG ACGCGCTGCT CGAGGAGTAG
 
Protein sequence
MQTESNGRPG AGGGAARPAL QQTLGTWQLW GIAVGLVISG EYFGWSYGWA SAGTLGFVVT 
ALFVAAMYTT FIFSFTELTT SIPHAGGPFA YARRAFGPAG GYLAGVATLV EFVFAPPAIA
LAIGAYLHVQ FSGLEPKHAA MGAYLVFMAL NIVGVQIAAT FELVVTLLAI FELLVFMGVV
SPGFAWSNFV KGGWAGADHF SAGAFHGMFA AIPFAIWFFL AIEGVAMAAE EAKHPKRSIP
IAYVAGILTL VALAIGVMVF AGGAGDWTKL ANINDPLPQA MKYIVGANSG WMHMLVWLGL
FGLVASFHGI ILGYSRQIFA LAREGYLPEW LAKVHPRFKT PYRAILAGGV VGIAAIYSDE
LIQFGGQTLT ANIVTMSVFG AIVMYIVSMA ALFKLRRVQP RMERPFRAPL YPFFPAFALV
AALVCLGTMV YFNALVASIF VAFVALGYGY FLATRAQREA APADALLEE