Gene BURPS668_A1118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A1118 
SymbolcodB 
ID4886574 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp1073163 
End bp1074488 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content67% 
IMG OID640131058 
Productcytosine permease 
Protein accessionYP_001062117 
Protein GI126445343 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1457] Purine-cytosine permease and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGCGGC TTTATTGGCC GGGCATCATC CGTTCACGCT GGAATGCTGG AGATCAAAAG 
ACGATGGCAA CACGGGGCGA ATTTTCATTG AGCGAGGTGC CCGCGCACGA GCGCAAAGGC
GCACTGTCGA TCACGATGGT GCTGCTCAGC TTCACGTTCT TCACGGGTAC GATGTTCGCG
GGCGGCAAGA TCGGCGTCGC GTTTCGCGTC GTCGACATGC TGTGGGTCGC GGTCGTCGGC
AATCTGCTGC TTGCCGCCTA TGCGGCCGCG CTCGCGTTCG TCGCGTCGCG CAGCGGGCTC
AATTCGGTGC TGATGGGGCG TTTCTGTTTC GGCGAGGTCG GCAGCAAGCT GTCCGATTTC
CTGCTCGGCT TCGCCGAACT CGGCTGGTAT GCGTGGGGCA CCGCGACGGT GGCGATCGTG
CTCGTCAAGC TGCTCGGCTG GCCCGCGTCG GTGACGACGC CGCTGATGGT GCTGTTCGGG
TTCGGCTTCT CGATTACCGC GATCGTCGGC TATCGCGGGA TGGACGCGCT CTCGCGCGTG
TCGGTGCCGC TGATGTTCGC GCTGCTCGTC GTGTCGATGT GGATCGCCAC GCGCGACGTC
GGCGGCTGGC CGGGCCTCGC GAAGATCGCG CCGACGCAGC CGATGAGCTT CGCCGCCGCG
GTCACGATGG TGTTCGGCAC GTTCGCGAGC GGCGCGACGC AGGCGACGAA CTGGACGCGG
CTCGCGAAGA GCGGCCGCGC GGCCGTCGCG GCGAGCATGA TCGGCTTTTT CGTCGGCAAT
GGGCTGATGA TCGTCGCGGG CGCGTATTGC GCGATCGTCT ATCAGCAGTC CGACATCGTC
GAAGTGATGA TGCTGCAAGG GCTGTCGATC GCGGCCGTCG TGATGCTCTG CCTGAACCTG
TGGACGATTC AGGGGCCGAC GATCTACAAC GTGTCGGCGG CCGCGTGCCA TCTGTTGCGC
AGCGAACGCC GCCGCACGCT GACGCTCGTC GGCGCGGCGG TCGGCATCGT GCTCGCGATC
GGCGGCATGT ACGAGATGCT GATCCCGTTC CTGATCCTGC TCGGCTCGAT CATTCCGCCC
GTCGGCGGCG TGATTCTCGC CGATTTCTGG TATCGGCACC GCGGCCGCTA TCCGGCGATC
GCGAGCGCCC GACTGCCGCG CTTCAATATC GCCGGGCTCG CCGCATATGC GATCGGCGCG
GCGCTCGCGT ACGCATCGCC GTGGATCGCG CCGCTCGTCG GCATCGCCGC GTCGTCGTTC
TGCTACATCG TGTTCGTGCA GATCGCGGGG CGCGCGGTGC GCGCGCCGTC GGTCCAGGGA
GAGTGA
 
Protein sequence
MRRLYWPGII RSRWNAGDQK TMATRGEFSL SEVPAHERKG ALSITMVLLS FTFFTGTMFA 
GGKIGVAFRV VDMLWVAVVG NLLLAAYAAA LAFVASRSGL NSVLMGRFCF GEVGSKLSDF
LLGFAELGWY AWGTATVAIV LVKLLGWPAS VTTPLMVLFG FGFSITAIVG YRGMDALSRV
SVPLMFALLV VSMWIATRDV GGWPGLAKIA PTQPMSFAAA VTMVFGTFAS GATQATNWTR
LAKSGRAAVA ASMIGFFVGN GLMIVAGAYC AIVYQQSDIV EVMMLQGLSI AAVVMLCLNL
WTIQGPTIYN VSAAACHLLR SERRRTLTLV GAAVGIVLAI GGMYEMLIPF LILLGSIIPP
VGGVILADFW YRHRGRYPAI ASARLPRFNI AGLAAYAIGA ALAYASPWIA PLVGIAASSF
CYIVFVQIAG RAVRAPSVQG E