Gene BURPS1106A_A1032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1032 
SymbolcodB 
ID4904063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp998084 
End bp999409 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content67% 
IMG OID640144138 
Productcytosine permease 
Protein accessionYP_001075068 
Protein GI126457523 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1457] Purine-cytosine permease and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGCGGC TTTATTGGCC GGGCATCATC CGTTCACGCT GGAATGCTGG AGATCAAAAG 
ACGATGGCAA CACGGGGCGA ATTTTCATTG AGCGAGGTGC CCGCGCACGA GCGCAAAGGC
GCACTGTCGA TCACGATGGT GCTGCTCAGC TTCACGTTCT TCACGGGTAC GATGTTCGCG
GGCGGCAAGA TCGGCGTCGC GTTTCGCGTC GTCGACATGC TGTGGGTCGC GGTCGTCGGC
AATCTGCTGC TTGCCGCCTA TGCGGCCGCG CTCGCGTTCG TCGCGTCGCG CAGCGGGCTC
AATTCGGTGC TGATGGGGCG TTTCTGTTTC GGCGAGGTCG GCAGCAAGCT GTCCGATTTC
CTGCTCGGCT TCGCCGAACT CGGCTGGTAT GCGTGGGGCA CCGCGACGGT GGCGATCGTG
CTCGTCAAGC TGCTCGGCTG GCCCGCGTCG GTGACGACGC CGCTGATGGT GCTGTTCGGG
TTCGGCTTCT CGATTACCGC GATCGTCGGC TATCGCGGGA TGGACGCGCT CTCGCGCGTG
TCGGTGCCGC TGATGTTCGC GCTGCTCGTC GTGTCGATGT GGATCGCCAC GCGCGACGTC
GGCGGCTGGC CGGGCATCGC GAAGATCGCG CCGACGCAGC CGATGAGCTT CGCCGCCGCG
GTCACGATGG TGTTCGGCAC GTTCGCGAGC GGCGCGACGC AGGCGACGAA CTGGACGCGG
CTCGCGAAGA GCGGCCGCGC GGCCGTCGCG GCGAGCATGA TCGGCTTTTT CGTCGGCAAT
GGGCTGATGA TCGTCGCGGG CGCGTATTGC GCGATCGTCT ATCAGCAGTC CGACATCGTC
GAAGTGATGA TGCTGCAAGG GCTGTCGATC GCGGCCGTCG TGATGCTCTG CCTGAACCTG
TGGACGATTC AGGGGCCGAC GATCTACAAC GTGTCGGCGG CCGCGTGCCA TCTGTTGCGC
AGCGAACGCC GCCGCACGCT GACGCTCGTC GGCGCGGCGG TCGGCATCGT GCTCGCGATC
GGCGGCATGT ACGAGATGCT GATCCCGTTC CTGATCCTGC TCGGCTCGAT CATTCCGCCC
GTCGGCGGCG TGATTCTCGC CGATTTCTGG TATCGGCACC GCGGCCGCTA TCCGGCGATC
GCGAGCGCCC GGCTGCCGCG CTTCAATATC GCCGGGCTCG CCGCATATGC GATCGGCGCG
GCGCTCGCGT ACGCATCGCC GTGGATCGCG CCGCTCGTCG GCATCGCCGC GTCGTCGTTC
TGCTACATCG TGTTCGTGCA GATCGCGGGC CGCGCGGTGC GCGCGCCGTC GGTCCAGGGA
GAGTGA
 
Protein sequence
MRRLYWPGII RSRWNAGDQK TMATRGEFSL SEVPAHERKG ALSITMVLLS FTFFTGTMFA 
GGKIGVAFRV VDMLWVAVVG NLLLAAYAAA LAFVASRSGL NSVLMGRFCF GEVGSKLSDF
LLGFAELGWY AWGTATVAIV LVKLLGWPAS VTTPLMVLFG FGFSITAIVG YRGMDALSRV
SVPLMFALLV VSMWIATRDV GGWPGIAKIA PTQPMSFAAA VTMVFGTFAS GATQATNWTR
LAKSGRAAVA ASMIGFFVGN GLMIVAGAYC AIVYQQSDIV EVMMLQGLSI AAVVMLCLNL
WTIQGPTIYN VSAAACHLLR SERRRTLTLV GAAVGIVLAI GGMYEMLIPF LILLGSIIPP
VGGVILADFW YRHRGRYPAI ASARLPRFNI AGLAAYAIGA ALAYASPWIA PLVGIAASSF
CYIVFVQIAG RAVRAPSVQG E