Gene BURPS1710b_2535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_2535 
Symbol 
ID3689966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp2806294 
End bp2807898 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content65% 
IMG OID637728991 
ProductNCS1 nucleoside transporter family protein 
Protein accessionYP_333927 
Protein GI76810631 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG1953] Cytosine/uracil/thiamine/allantoin permeases 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.173195 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGGCG GCCGCGTGTG CGGACGCCTC GCCAGGGCGG CATCGGGCGA TCCCGTTCGA 
GCCGCGATTT CAATGACGAT CGCGCGGCTC GTCCGCGGCA GGCAAGGAGA TGTCATGGCT
CAGTTCAGTG TGGCGCGGCA AAGCGCCTCG TATCGGCCGA ACGAGGATCG CGCCGGCGGC
CCGGATGGCG GCGCTCAGAT GCCCGCCGGC TACAGCAGTC GTCTGTACAA CGAAGATCTC
GCGCCGCTCG CGAGCCAGCG CTGGGGCGCA TACAACATCT TCGCGTTCTG GATGTCGGAC
GTGCACAGCG TCGGCGGCTA CGTGTTTGCG GGCAGCCTGT TCGCGCTCGG TCTGACGAGC
TGGCAGGTGC TCGTCGCGCT GATCGTCGGC ATTTCGATCG TCAACGTGCT GTGCAACCTG
ATCGCGAGGC CGAGCCAGCA GCTAGGCGTG CCGTATCCGG TGGCATGCCG CGCGACGTTC
GGCGTGCTCG GCGCGAACGT GCCCGCGGTG ATCCGCGGCC TCATCGCGAT CGCATGGTAC
GGCATCCAAA CTTATCTGGC GTCGAGCGCG CTCGTGATCG TCGTGCTCAA GTTCTTTCCG
CACTGGATGC CGTACGCGGA CGTGCATCGC TACGGCTTTC TCGGGCTGTC GGCGCTCGGC
TGGGCGGGCT TCATGCTGCT GTGGGTGCTG CAGGCGTTCG TGTTCTGGAA CGGCATGGAG
ACGATCAAGA AGTTCATCGA TTTCGCCGGC CCCGCCGTCT ACGTGGTGAT GTTCGCCCTC
GCGGGCTACA TGGTATGGCG CGCGGGCTGG CGCAATATCG GCCTGAATCT CGGCGGCGTC
CGGTATCACG GCGCCGAAGT GATTCCGGTG ATGGTGACGG CGATCTCGCT CGTCGTGTCG
TATTTCTCGG GGCCGATGCT CAACTTCGGC GATTTCTCGC GTTATTGCAG CAGCTACGGC
GGCGTGAAGC GCGGCAATTT CTGGGGGCTG CCCGTCAATT TCCTCGCGTT CTCGCTCGTC
ACCGTGATCA CGACGGCCGC GACGCTGCCG GTGTTCGGAG AACTGATCAC CGATCCCGTC
GAGACGGTCG GGCGCATCGA TCATCCGAGC GCCGTGATAC TTGGCGCGCT GACCTTCACG
ATCGCGACGA TCGGCATCAA CATCGTCGCG AATTTCGTGT CACCCGCGTT CGATTTCTCG
AACGTCGCGC CGCGCCTGAT CAGTTGGCGC GCGGGCGGGA TGCTCGCGGC GGTCGCATCG
GTGTTCATCA CGCCGTGGAA TCTCTTCAAC AATCCCGCGG TGATCCATTA CACGCTCGAC
GTGCTCGGCG GCTTCATCGG GCCGCTGTAC GGCGTGCTGA TCGTCGATTT CTATCTCGTG
AAGCGCGGCG CGCTGCGGCG CGACGATCTG TACACGACGT CGGCCGACGG CGCGTACTGG
TATCGCGACG GCGTGAACCG GCGCGCGATC GCCGCGCTGT TGCCCGCGGC CGCGATCGCC
GTCGCATGCG TGATGGCGCC CGCGCTGTCC GGGCTCGCGA ATTTCTCGTG GTTCATCGGC
GCGGCGCTCG GCGGCGCGTT CTATCGCGCG CTTGCGAAAG CATGA
 
Protein sequence
MPGGRVCGRL ARAASGDPVR AAISMTIARL VRGRQGDVMA QFSVARQSAS YRPNEDRAGG 
PDGGAQMPAG YSSRLYNEDL APLASQRWGA YNIFAFWMSD VHSVGGYVFA GSLFALGLTS
WQVLVALIVG ISIVNVLCNL IARPSQQLGV PYPVACRATF GVLGANVPAV IRGLIAIAWY
GIQTYLASSA LVIVVLKFFP HWMPYADVHR YGFLGLSALG WAGFMLLWVL QAFVFWNGME
TIKKFIDFAG PAVYVVMFAL AGYMVWRAGW RNIGLNLGGV RYHGAEVIPV MVTAISLVVS
YFSGPMLNFG DFSRYCSSYG GVKRGNFWGL PVNFLAFSLV TVITTAATLP VFGELITDPV
ETVGRIDHPS AVILGALTFT IATIGINIVA NFVSPAFDFS NVAPRLISWR AGGMLAAVAS
VFITPWNLFN NPAVIHYTLD VLGGFIGPLY GVLIVDFYLV KRGALRRDDL YTTSADGAYW
YRDGVNRRAI AALLPAAAIA VACVMAPALS GLANFSWFIG AALGGAFYRA LAKA