Gene Bcep18194_B2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_B2039 
Symbol 
ID3753804 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007511 
Strand
Start bp2335950 
End bp2337500 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content67% 
IMG OID637766887 
Productcytosine/purines, uracil, thiamine, allantoin transporter 
Protein accessionYP_372796 
Protein GI78062888 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG1953] Cytosine/uracil/thiamine/allantoin permeases 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0859563 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.104013 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGGAG ACAACGGCGT GCAATTGGAA GCCACCCACC GCCCGGGCAT CGTGCTCGGC 
GCGGACGACG CATCGTATCG AACCGATTCG GCCGCGCGCG ACCCCGCGCT GAGCCCGCGG
CTGCACAACC CCGACCTCGC ACCCACCAAG GCCGAGGGCC GGACGTGGGG CCGCTACAGC
ATCTTTGCGC TGTGGACCAA CGACGTGCAC AACATCGCGA ACTACTCGTT CGCGATCGGG
CTGTTCGCGC TCGGCCTGTC GGGCTGGCAG ATGCTCGCGT CGCTCGCGAT CGGCGCGGTG
CTCGTGTACT GCTTCATGAA CCTCACCGGC TACATGGGCC AGAAGACCGG CGTGCCGTTC
CCGGTGATCA GCCGGATGAG CTTCGGCATC TACGGCGCGC TGCTGCCCGC AATGATCCGC
GCGGTGATCG CGATCGCATG GTTCGGGATC CAGACCTATC TCGCGTCGGT CGTGCTGCGC
GTGCTGCTCA CCGCGATCTG GCCGAGCCTC GCCGCGTTCG ACCAGAACGC GATCTTCGGG
CTGTCGACGC TCGGCTGGGT CACGTTCGTC GCGATCTGGC TCGTGCAGAT CGGCATCCTC
ACGTACGGGA TGGAAATGGT CCGCAAGTAC GAAGGGCTGG CCGGCCCGGT CATCCTCGTC
ACGACACTGT CGCTCGCCGC ATGGATGTAT AGCCGCACCG GCGGCCATCT CGCGATGTCG
ATCGGCAAGC CGCTGACCGG CTTCAAGATG TGGACGGAGA TCTTCGCGGG CGGCTCGCTG
TGGCTCGCGA TCTACGGCAC GCTGGTGCTC AACTTCTGCG ATTTCGCGCG CTCGTCGCCG
AGCGCGAAGA CGGTGCGCGT GGGCAACTTC TGGGGCCTGC CGGTCAACAT CCTCGTGTTC
GCGACGATCA GCTTCGTGCT CGCCGGCGCG CAGTTCAAGC TGAACGGCCA CATCATCCAC
AGCCCGACGG AAATCATCGC GACGGTGCCG AACAAGCTGT TCCTCGTGCT CGGTTGCCTC
GCGTTCCTGA TCGTGACGGT CGCCGTGAAC ATCATGGCGA ACTTCGTCGC GCCGGCCTTC
GTGCTGACGA GCCTGGCGCC GCATCGCCTG TCGTTCCGCC GCGCGGGCCT GATCAGCGCA
ACGGTCGCCG TGCTGATCCT GCCGTGGAAC CTGTACAACA GCCCGATCGT GATCGTCTAC
TTCCTGTCCG GCCTCGGCGC GCTGCTCGGC CCGCTGTACG GGATCATCAC GGTCGACTAC
TGGCTCGTGC GCAAGCAGCG CGTGAACGTG CCCGACCTCT ATACCGAAGC GCCGACCGGC
ACCTACTTCT ACACGCGCGG CGTGAACCGC AAGGCGCTCG CGGCGCTCGT GCCGTCCGCG
CTGATCTCGA TCACGCTCGC CGTCGTGCCG GCCTTCAGCG CGATGACGCC GTTCTCGTGG
CTGCTCGGCG CGGCCATCGC GGGCACCGTG TACTGGCTGC TGGCCGATCG CAACCGGCAC
TACGAGGAGC GTTCGGGCGA GCCCATCGCG GTCGCCTGCG CCCAACACTG A
 
Protein sequence
MEGDNGVQLE ATHRPGIVLG ADDASYRTDS AARDPALSPR LHNPDLAPTK AEGRTWGRYS 
IFALWTNDVH NIANYSFAIG LFALGLSGWQ MLASLAIGAV LVYCFMNLTG YMGQKTGVPF
PVISRMSFGI YGALLPAMIR AVIAIAWFGI QTYLASVVLR VLLTAIWPSL AAFDQNAIFG
LSTLGWVTFV AIWLVQIGIL TYGMEMVRKY EGLAGPVILV TTLSLAAWMY SRTGGHLAMS
IGKPLTGFKM WTEIFAGGSL WLAIYGTLVL NFCDFARSSP SAKTVRVGNF WGLPVNILVF
ATISFVLAGA QFKLNGHIIH SPTEIIATVP NKLFLVLGCL AFLIVTVAVN IMANFVAPAF
VLTSLAPHRL SFRRAGLISA TVAVLILPWN LYNSPIVIVY FLSGLGALLG PLYGIITVDY
WLVRKQRVNV PDLYTEAPTG TYFYTRGVNR KALAALVPSA LISITLAVVP AFSAMTPFSW
LLGAAIAGTV YWLLADRNRH YEERSGEPIA VACAQH