Gene BURPS1710b_A2333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_A2333 
SymbolcodB 
ID3692805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007435 
Strand
Start bp2837413 
End bp2838675 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content67% 
IMG OID637732587 
Productcytosine permease 
Protein accessionYP_337484 
Protein GI162210108 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1457] Purine-cytosine permease and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACAC AGGGCGAATT TTCATTGAGC GAGGTGCCCG CGCACGAGCG CAAAGGCGCA 
CTGTCGATCA CGATGGTGCT GCTCAGCTTC ACGTTCTTCA CGGGTACGAT GTTCGCGGGC
GGCAAGATCG GCGTCGCGTT TCGCGTCGTC GACACGCTGT GGGTCGCGGT CGTCGGCAAT
CTGCTGCTTG CCGCCTATGC GGCCGCGCTC GCGTTCGTCG CGTCGCGCAG CGGGCTCAAT
TCGGTGCTGA TGGGGCGTTT CTGTTTCGGC GAGGTCGGCA GCAAGCTGTC CGATTTCCTG
CTCGGCTTCG CCGAACTCGG CTGGTATGCG TGGGGCACCG CGACGGTGGC GATCGTGCTC
GTCAAGCTGC TCGGCTGGCC CGCGTCGGTG ACGACGCCGC TGATGGTGCT GTTCGGGTTC
GGCTTCTCGA TTACCGCGAT CGTCGGCTAT CGCGGGATGG ACGCGCTCTC GCGCGTGTCG
GTGCCGCTGA TGTTCGCGCT GCTCGTCGTG TCGATGTGGA TCGCCACGCG CGACGTCGGC
GGCTGGCCGG GCCTCGCGAA GATCGCGCCG ACGCAGCCGA TGAGCTTCGC CGCCGCGGTC
ACGATGGTGT TCGGCACGTT CGCGAGCGGC GCGACGCAGG CGACGAACTG GACGCGGCTC
GCGAAGAGCG GCCGCGCGGC CGTCGTGGCG AGCATGATCG GCTTTTTCGT CGGCAATGGG
CTGATGATCG TCGCGGGCGC GTATTGCGCG ATCGTCTATC AGCAGTCCGA CATCGTCGAA
GTGATGATGC TGCAAGGGCT GTCGATCGCG GCCGTCGTGA TGCTCTGCCT GAACCTGTGG
ACGATTCAGG GGCCGACGAT CTACAACGTG TCGGCGGCCG CGTGCCATCT GTTGCGCAGC
GAACGCCGCC GCACGCTGAC GCTCGTCGGC GCGGCGGTCG GCATCGTGCT CGCGATCGGC
GGCATGTACG AGATGCTGAT CCCGTTCCTG ATCCTGCTCG GCTCGATCAT TCCGCCCGTC
GGCGGCGTGA TTCTCGCCGA TTTCTGGTAT CGGCACCGCG GCCGCTATCC GGCGATCGCG
AGCGCCCGGC TGCCGCGCTT CAATATCGCC GGGCTCGCCG CATATGCGAT CGGCGCGGCG
CTCGCGTACG CATCGCCGTG GATCGCGCCG CTCGTCGGCA TCGCCGCGTC GTCGTTCTGC
TACATCGTGT TCGTGCAGAT CGCGGGCCGC GCGGTGCGCG CGCCGTCGGT CCAGGGAGAG
TGA
 
Protein sequence
MATQGEFSLS EVPAHERKGA LSITMVLLSF TFFTGTMFAG GKIGVAFRVV DTLWVAVVGN 
LLLAAYAAAL AFVASRSGLN SVLMGRFCFG EVGSKLSDFL LGFAELGWYA WGTATVAIVL
VKLLGWPASV TTPLMVLFGF GFSITAIVGY RGMDALSRVS VPLMFALLVV SMWIATRDVG
GWPGLAKIAP TQPMSFAAAV TMVFGTFASG ATQATNWTRL AKSGRAAVVA SMIGFFVGNG
LMIVAGAYCA IVYQQSDIVE VMMLQGLSIA AVVMLCLNLW TIQGPTIYNV SAAACHLLRS
ERRRTLTLVG AAVGIVLAIG GMYEMLIPFL ILLGSIIPPV GGVILADFWY RHRGRYPAIA
SARLPRFNIA GLAAYAIGAA LAYASPWIAP LVGIAASSFC YIVFVQIAGR AVRAPSVQGE