Gene BURPS1106A_A2667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2667 
Symbol 
ID4904203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2604844 
End bp2606544 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content69% 
IMG OID640145770 
ProductNa+ dependent nucleoside transporter family protein 
Protein accessionYP_001076697 
Protein GI126458076 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0472301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCGGATA GCGCGCGCCC GGCGCGCCCG ACGAGCGCAT CCGCTTTGAA GCGTTCGGGT 
CGTCGGGCAT TCGCCGGCCG CGCATCGCCA TCGCCATCGC CATCGCCATC GCCATCGCCA
TCGCCATCGC CATCGCCATC GCCATCGCCA TCGCCATCGC CATCGCCATC GCCATCGCCA
TCGCCCCGGA AAAATTCCCC GTCTCTACCT CGCCAAAATC CCGCCGACAA CCAAAATCGC
TCGCCGCCCG CGCGAGCGCG CATCTTTCGT TCCCCGATCG ATTCGACCAC TGGCGTTCGC
CGTCCGCGCA CGTTATGTTC TCGTCCGATG AATAAACGGC GCGCGCGCGC CCGACCGGAG
ACAGCCGGCG GCCGGCCCGG TTGCGCCGGC GGCGGCGCTT TCCATCCCAT CCTCCGAGGT
ACGTTCGTGG ACATCTTGCG CAGCTTGTGT GGCATCGTGG TATTGCTCGG CGTCGGTTAT
GCGCTGTCGG TCAACCGGCG GGCGATCAGC GCCCGCACCG TCGTCGCTGC GCTCGCGACG
CAGCTCGCGA TCGGCGCGCT CGTGCTGTTC GTGCCCGTCG GCCGCGACGC GCTCGCCGGC
ACCGCGCATG CGGTCAACAG CGTGCTCGAG ATGGGGCAGC ACGGCGTCGC GTTCCTGTTC
GGCGGCCTCG TCGGCGACAA GATGTTCGCG CTCTTCGGCG ACGGCGGCTT CGCGTTCGCG
CTGCGCGTGC TGCCGATGAT CGTGTTCGTC ACGTCGCTGA TCGCGGTGCT CTATTACATC
GGCGTGATGA AGTGGCTGAT CCGCATCGTC GGCACCGCGA TGGCGAAGCT GCTCGGCGTG
AGCCGCATCG AGGCGTGCTC GGCCGTGGCG ACGATCTTCC TCGGCCAGAG CGAGATGCCC
GCGTTCGTGA AGCCGTTCGT GCGGCGGATG AGCGGCACCG AAGTGTTCGC GGTGATGTCG
AGCGGCATGG CGTCGGTCGC GGGTTCGGTG CTCGCCGGCT ACGCGGGGCT CGGCGTGAAG
ATGGAGTATC TGCTCGCCGC GTCGTTCATG GCGATCCCGG GCGGCCTGCT GTTCGGCAAG
ATGCTGTGCC CGACGACGGA GCCCTCGCGC GTCGCCGTCG ACTCGCTCGA GTTCGACGAG
AAGCGCGCGG CGAACGTGAT CGAGGCGGCC GCCTCCGGCG CGGGCGTCGG GATGCGCATC
GCGGTGAACG TCGGTACCAT GCTGATCGCG TTCATCGGCC TCATCGCGCT GCTCAACGCG
ATGGTCGGGC TCGCCGCCGG CTGGCTCGGC TTCGCCGGCG TCACGCTGCA ATCGCTGCTC
GGCGCGCTGT TCTCGCCGCT CGCGTGGCTG ATCGGCGTGC CGTGGCGGGA TGCGCCGGTG
GCCGGCAGCT TCATCGGCCA GAAGCTGATC CTGAACGAGT TCGTCGCGTA CGGCGCGCTG
TCGCCCTATC TGAAGGATGC CGCGCAGGTC GTCGCGGCCG GCTTGCCGGT GCTCGCGCCG
AAGACGATCG CGATCGTGTC GTTCGCGCTG TGCGGCTTCG CGAATTTCTC GTCGATCGCG
ATCCTGACGG GCGGCTTCAC CGCGGTCGAG CCCGGCATGC GCTCCGAAGT CGCGCGCTAC
GGCCTGCGCG CGCTCGCGGC CGCGACGCTG TCGAACCTGA TGAGCGCGAC GATCGCCGGG
CTGTTCCTGT CCCTTTCCTG A
 
Protein sequence
MADSARPARP TSASALKRSG RRAFAGRASP SPSPSPSPSP SPSPSPSPSP SPSPSPSPSP 
SPRKNSPSLP RQNPADNQNR SPPARARIFR SPIDSTTGVR RPRTLCSRPM NKRRARARPE
TAGGRPGCAG GGAFHPILRG TFVDILRSLC GIVVLLGVGY ALSVNRRAIS ARTVVAALAT
QLAIGALVLF VPVGRDALAG TAHAVNSVLE MGQHGVAFLF GGLVGDKMFA LFGDGGFAFA
LRVLPMIVFV TSLIAVLYYI GVMKWLIRIV GTAMAKLLGV SRIEACSAVA TIFLGQSEMP
AFVKPFVRRM SGTEVFAVMS SGMASVAGSV LAGYAGLGVK MEYLLAASFM AIPGGLLFGK
MLCPTTEPSR VAVDSLEFDE KRAANVIEAA ASGAGVGMRI AVNVGTMLIA FIGLIALLNA
MVGLAAGWLG FAGVTLQSLL GALFSPLAWL IGVPWRDAPV AGSFIGQKLI LNEFVAYGAL
SPYLKDAAQV VAAGLPVLAP KTIAIVSFAL CGFANFSSIA ILTGGFTAVE PGMRSEVARY
GLRALAAATL SNLMSATIAG LFLSLS