Gene BURPS1106A_2434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2434 
Symbol 
ID4900582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp2392493 
End bp2394025 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content65% 
IMG OID640135662 
ProductNCS1 nucleoside transporter family protein 
Protein accessionYP_001066694 
Protein GI126452220 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG1953] Cytosine/uracil/thiamine/allantoin permeases 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGATCG CGCGGCTCGT CCGCGGCAGG CAAGGAGATG TCATGGCTCA GTTCAGTGTG 
GCGCGGCAAA GCGCCTCGTA TCGGCCGAAC GAGGATCGCG CCGGCGGCCC GGATGGCGGC
GCTCAGATGC CCGCCGGCTA CAGCAGTCGT CTGTACAACG AAGATCTCGC GCCGCTCGCG
AGCCAGCGCT GGGGCGCATA CAACATCTTC GCGTTCTGGA TGTCGGACGT GCACAGCGTC
GGCGGCTACG TGTTTGCGGG CAGCCTGTTC GCGCTCGGTC TGACGAGCTG GCAGGTGCTC
GTCGCGCTGA TCGTCGGCAT TTCGATCGTC AACGTGCTGT GCAACCTGAT CGCGAGGCCG
AGCCAGCAGC TAGGCGTGCC GTATCCGGTG GCATGCCGCG CGACGTTCGG CGTGCTCGGC
GCGAACGTGC CCGCGGTGAT CCGCGGCCTC ATCGCGATCG CATGGTACGG CATCCAAACT
TATCTGGCGT CGAGCGCGCT CGTGATCGTC GTGCTCAAGT TCTTTCCGCA CTGGATGCCG
TACGCGGACG TGCATCGCTA CGGCTTTCTC GGGCTGTCGG CGCTCGGCTG GGCGGGCTTC
ATGCTGCTGT GGGTGCTGCA GGCGTTCGTG TTCTGGAACG GCATGGAGAC GATCAAGAAG
TTCATCGATT TCGCCGGCCC CGCCGTCTAC GTGGTGATGT TCGCCCTCGC GGGCTACATG
GTATGGCGCG CGGGCTGGCG CAATATCGGC CTGAATCTCG GCGGCGTCCG GTATCACGGC
GCCGAAGTGA TTCCGGTGAT GGTGACGGCG ATCTCGCTCG TCGTGTCGTA TTTCTCGGGG
CCGATGCTCA ACTTCGGCGA TTTCTCGCGT TATTGCAGCA GCTACGGCGG CGTGAAGCGC
GGCAATTTCT GGGGGCTGCC CGTCAATTTC CTCGCGTTCT CGCTCGTCAC CGTGATCACG
ACGGCCGCGA CGCTGCCGGT GTTCGGAGAA CTGATCACCG ATCCCGTCGA GACGGTCGGG
CGCATCGATC ATCCGAGCGC CGTGATACTT GGCGCGCTGA CCTTCACGAT CGCGACGATC
GGCATCAACA TCGTCGCGAA TTTCGTGTCG CCCGCGTTCG ATTTCTCGAA CGTCGCGCCG
CGCCTGATCA GTTGGCGCGC GGGCGGGATG CTCGCGGCGG TCGCATCGGT GTTCATCACG
CCGTGGAATC TCTTCAACAA TCCCGCGGTG ATCCATTACA CGCTCGACGT GCTCGGCAGC
TTCATCGGGC CGCTGTACGG CGTGCTGATC GTCGATTTCT ATCTCGTGAA GCGCGGCGCG
CTGCGGCGCG ACGATCTGTA CACGACGTCG GCCGACGGCG CGTACTGGTA TCGCGACGGC
GTGAACCGGC GCGCGATCGC CGCGCTGTTG CCCGCGGCCG CGATCGCCGT CGCATGCGTG
ATGGCGCCCG CGCTGTCCGG GCTCGCGAAT TTCTCGTGGT TCATCGGCGC GGCGCTCGGC
GGCGCGTTCT ATCGCGCGCT CGCGAAAGCA TGA
 
Protein sequence
MTIARLVRGR QGDVMAQFSV ARQSASYRPN EDRAGGPDGG AQMPAGYSSR LYNEDLAPLA 
SQRWGAYNIF AFWMSDVHSV GGYVFAGSLF ALGLTSWQVL VALIVGISIV NVLCNLIARP
SQQLGVPYPV ACRATFGVLG ANVPAVIRGL IAIAWYGIQT YLASSALVIV VLKFFPHWMP
YADVHRYGFL GLSALGWAGF MLLWVLQAFV FWNGMETIKK FIDFAGPAVY VVMFALAGYM
VWRAGWRNIG LNLGGVRYHG AEVIPVMVTA ISLVVSYFSG PMLNFGDFSR YCSSYGGVKR
GNFWGLPVNF LAFSLVTVIT TAATLPVFGE LITDPVETVG RIDHPSAVIL GALTFTIATI
GINIVANFVS PAFDFSNVAP RLISWRAGGM LAAVASVFIT PWNLFNNPAV IHYTLDVLGS
FIGPLYGVLI VDFYLVKRGA LRRDDLYTTS ADGAYWYRDG VNRRAIAALL PAAAIAVACV
MAPALSGLAN FSWFIGAALG GAFYRALAKA