Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I2065 |
Symbol | |
ID | 3848967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | + |
Start bp | 2338228 |
End bp | 2339718 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637841734 |
Product | NCS1 nucleoside transporter family protein |
Protein accession | YP_442589 |
Protein GI | 83719446 |
COG category | [F] Nucleotide transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG1953] Cytosine/uracil/thiamine/allantoin permeases |
TIGRFAM ID | [TIGR00800] NCS1 nucleoside transporter family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCAGT TCAGTGTGGC GCAGCAAAGC GCCTCGTATC GGCCGAACGA GGGGCGCCCG GGCGACCCGG ACGGCGGCGC GGCGATGCCC GCCGGCTACA GCAAACGTCT GTACAACGAA GATCTCGCGC CGCTCGCGAA CCAGAACTGG GGGGCATATA ACATCTTCGC GTTCTGGATG TCGGACGTGC ACAGCGTCGG CGGCTACGTG TTCGCGGGCA GCCTGTTCGC GCTCGGCCTG ACGAGCTGGC AGGTGCTCGC CGCGCTGATC GTCGGCATTT CGATCGTCAA CGTGCTGTGC AACCTGATCG CGAAGCCGAG CCAGCAGCTC GGCGTGCCGT ATCCGGTGGC ATGCCGCGCG ACGTTCGGCG TGCTCGGGGC GAACGTGCCC GCGGTGATTC GCGGCCTCAT CGCGATCGCA TGGTACGGAA TCCAAACTTA CCTCGCGTCG AGCGCGCTCG TGATCGTCGT GCTCAAGTTC TTTCCGCAAT GGATGCCGTA CGCGGACGTG CATCGCTATG GCTTTCTCGG GCTGTCGGCG CTCGGCTGGG CGGGCTTCAT GCTGCTGTGG GTGCTGCAGG CGTTCGTGTT CTGGAACGGC ATGGAGACGA TCAAGAAGTT CATCGATTTC GCCGGTCCTG CGGTGTACGT GGTGATGTTC GTCCTCGCGG GCTACATGGT GTGGCGCGCG GGCTGGCGCA ACATTGGCCT CGATCTCGGC GGCGTCAAGT ATCACGGCGC CGAAGTGCTT CCGGTGATGG TGACGGCGAT CTCGCTCGTC GTGTCGTATT TCTCCGGACC GATGCTCAAC TTCGGCGACT TCTCGCGGTA CTGCAGGAGC TACGGCAGCG TGAAGCGCGG CAATTTCTGG GGATTGCCCG TCAACTTTCT CGCGTTCTCG CTCGTCACCG TCATCACGAC GGCCGCGACG CTGCCGGTGT TCGGACAACT GATCACGGAC CCCGTCGAGA CGGTCGGCCG CATCGATCAT CCGACCGCCG TGATTCTCGG CGCGCTGACC TTCACGATCG CGACGATCGG CATCAACATC GTCGCGAACT TCGTGTCGCC CGCATTCGAC TTCTCGAACG TCGCGCCGCG CCTGATCAGC TGGCGTGCGG GCGGGATGCT CGCGGCGGTT GCCTCGGTGT TCATCACGCC GTGGAATCTC TTCAACAATC CCGCGGTGAT CCATTACACG CTCGACGTGC TCGGCAGCTT CATCGGGCCG CTGTATGGCG TGCTGATCGT CGATTTCTAC CTCGTGAAGC GCGGCGCGCT GCGGCGCGAC GATCTGTATA CGACGTCGGC CGACGGCGCG TACTGGTATC GCGACGGCGT GAACCGGCGC GCGGTCGCCG CGTTGTTGCC CGCGGCCGCG ATCGCGGTTG CGTGCGTGAT GGTGCCCGCG CCGTCCGGGC TCGCGAACTT CTCCTGGTTC ATCGGCGCGG CGCTCGGCGG TGCGTTCTAC CGCGCGCTTG CGAAAGCATG A
|
Protein sequence | MAQFSVAQQS ASYRPNEGRP GDPDGGAAMP AGYSKRLYNE DLAPLANQNW GAYNIFAFWM SDVHSVGGYV FAGSLFALGL TSWQVLAALI VGISIVNVLC NLIAKPSQQL GVPYPVACRA TFGVLGANVP AVIRGLIAIA WYGIQTYLAS SALVIVVLKF FPQWMPYADV HRYGFLGLSA LGWAGFMLLW VLQAFVFWNG METIKKFIDF AGPAVYVVMF VLAGYMVWRA GWRNIGLDLG GVKYHGAEVL PVMVTAISLV VSYFSGPMLN FGDFSRYCRS YGSVKRGNFW GLPVNFLAFS LVTVITTAAT LPVFGQLITD PVETVGRIDH PTAVILGALT FTIATIGINI VANFVSPAFD FSNVAPRLIS WRAGGMLAAV ASVFITPWNL FNNPAVIHYT LDVLGSFIGP LYGVLIVDFY LVKRGALRRD DLYTTSADGA YWYRDGVNRR AVAALLPAAA IAVACVMVPA PSGLANFSWF IGAALGGAFY RALAKA
|
| |