Gene Bcep18194_C6667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcep18194_C6667 
Symbol 
ID3733989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia sp. 383 
KingdomBacteria 
Replicon accessionNC_007509 
Strand
Start bp181255 
End bp182646 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content62% 
IMG OID637760374 
ProductNa+/solute symporter 
Protein accessionYP_366361 
Protein GI78059786 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.471745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.266697 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCTG TGGTATTCAC GGGTTTCATC CTCCTGTCGT TCTGCCTCGC GCTTTACTCG 
CGCCGAGGCG TAGGCAAGCA GAGCGTGCAC GATTTCTTCG TCGCATCGCG GCAGTTCGGT
GCATTTCTCG TCTTCTTCCT GGCAGCGGGC GAGATTTACA GCGTCGCGAC GATGGTCGGC
TTCGCGGGCG GCATCTATGC GAAGGGGCCG ACCTACGGGA TCTGGTTCCT CGGCTACATT
CTGCTCGCCT ACCCGCTCGG CTATTTTCTC GGCCCGAAGA TCTGGGAAGC AGGGCAGCGC
TACAACGCGA TTACGCTTGC GGACCTGTTC GGCGGCTATT TCCGAAGCCG CTCGACCGAG
TTCGTCGTCG CGCTGTCGTC GATCGTGTTT CTGCTGCCGA TGGCTCAACT GCAGTTCACG
GGCCTCGTTG CCGCGTTTCG TGGCCTGGGC TGGCAATTCG AGCCGCTGCA CATGGTGCTG
ATCGCTGGTG TGCTCGCGTT TCTATACATC ATGATCGCCG GTATCCGTTC GTCGGCGTAC
GTCGCCGTGT TGAAGGACAT CCTGATGGTC CTGGCGATCG TGATCACGGG CCTGGCCGTG
GCGGGACACG TCGGCGTGAC GGAGGTGTTC CACGCAGCGA GCCTGCACGT GGGCAACCAG
ATGAATGCGG AACAGCTGCG GTTCTCGATG AGCACGATCC TGCTGCAATC GCTCGGCTTC
CTTGCGATGC CGTTCGGTGT GCAGATTTTC TTCACCGCGA AAAGCGCCGA CACGATCCGA
CGCTCGCAGA TCGCGATGCC GCTCTATATG CTGATGTATC CGTTCCTCGT CATCGCCGCG
TACTACGCAA TCAGCCAGAA CCTGCATCTC CGCTCGCCGA ACGAAGCGTT CTTCGCTGCC
GCGAATGCGT TGCTCCCGTC GTGGATGCTC GGGCTCGTCG CAGCGGCGGC AGCGCTCTCC
GGCCTGCTCG TGCTGACCAG CATGTGTCTC GCGATCGGCC CGATCGTGAG CCGCAACCTG
CTGCCGTCGC TGCCGTCGCA ACGGCAAACA GGGGCGGCGA AGATCGTCAT TTTCGTGTAC
CTCGGTGTGT CCATTGCGAT GACGTCTGCA GCACCCACGC TGATGCTTAC GCTGATCAAC
GTCACCTATT ACGGCGTCAC CCAGTTCTTC CCGGGCCTGA TCGCCGTGCT GTTCTCGCTG
CGTATCCGGC CGGTGGCGGT GACGGCAGGC ATGCTCGTTG GACAAGGTCT CGCGTTGGCG
CTGTATCTCG GGAAGGTTCA GCTCGGCGGC ATCAACCTGG GTTTGCCGTG CCTGGCCGCC
AACATCGCTA CGGTCGCGGC GATCCATTAC CTGCTGGGCG CGGCCAAGCC CCGGACGTTG
GCCTCGCAAT GA
 
Protein sequence
MSSVVFTGFI LLSFCLALYS RRGVGKQSVH DFFVASRQFG AFLVFFLAAG EIYSVATMVG 
FAGGIYAKGP TYGIWFLGYI LLAYPLGYFL GPKIWEAGQR YNAITLADLF GGYFRSRSTE
FVVALSSIVF LLPMAQLQFT GLVAAFRGLG WQFEPLHMVL IAGVLAFLYI MIAGIRSSAY
VAVLKDILMV LAIVITGLAV AGHVGVTEVF HAASLHVGNQ MNAEQLRFSM STILLQSLGF
LAMPFGVQIF FTAKSADTIR RSQIAMPLYM LMYPFLVIAA YYAISQNLHL RSPNEAFFAA
ANALLPSWML GLVAAAAALS GLLVLTSMCL AIGPIVSRNL LPSLPSQRQT GAAKIVIFVY
LGVSIAMTSA APTLMLTLIN VTYYGVTQFF PGLIAVLFSL RIRPVAVTAG MLVGQGLALA
LYLGKVQLGG INLGLPCLAA NIATVAAIHY LLGAAKPRTL ASQ