Gene B21_02018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02018 
SymbolyehY 
ID8114681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2111576 
End bp2112733 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content58% 
IMG OID644848230 
Producthypothetical protein 
Protein accessionYP_002999803 
Protein GI251785499 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1174] ABC-type proline/glycine betaine transport systems, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTTATT TCCGTATTAA TCCTGTTCTG GCGCTGCTGC TGTTGCTGAC GGCAATCGCA 
GCGGCGCTGC CGTTTATCAG TTACGCGCCT AATCGTTTAG TTTCGGGTGA GGGGCGTCAC
CTCTGGCAGC TGTGGCCGCA AACGATCTGG ATGCTGGTGG GCGTTGGTTG CGCCTGGCTG
ACGGCCTGTT TTATTCCCGG TAAAAAAGGC AGCATTTGTG CACTCATTCT GGCGCAATTC
GTCTTCGTAT TGCTGGTGTG GGGAGCTGGA AAGGCGGCGA CCCAACTGGC GCAAAATGGC
AGTGCGCTGG CGCGTACCAG CCTCGGCAGT GGTTTCTGGC TGGCTGCGGC GCTGGCATTG
CTGGCCTGTA GCGATGCCAT CCGCCGAATC TCCACGCATC CGCTGTGGCG CTGGTTGTTG
CATATGCAGA TTGCCATTAT TCCGCTGTGG TTGCTGTACT CCGGCACGCT TAACGATCTC
TCACTAATGA AAGAATACGC CAACCGTCAG GATGTGTTTG ACGACGCGCT GGCACAACAT
CTGACGTTGC TGTTTGGTGC GGTGCTGCCT GCGTTAGTGA TTGGTGTGCC GTTGGGCATC
TGGTGCTACT TTTCCACTGC TCGGCAGGGG GCAATTTTTT CTCTGCTCAA TGTCATTCAG
ACCGTGCCTT CGGTGGCGCT CTTTGGCCTG TTGATTGCGC CGCTTGCCGC GCTGGTGACG
GCCTTTCCGT GGCTGGGGAA GCTCGGCATA GCAGGAACCG GAATGACACC CGCACTGATT
GCGCTGGTGC TCTATGCCTT GCTGCCGCTG GTGCGCGGCG TGGTAGTCGG CTTGAACCAG
ATCCCGCGCG ATGTGCTGGA GAGCGCCAGA GCGATGGGCA TGAGCGGGGC GCGGCGATTC
CTGCATGTTC AGTTACCACT GGCGTTACCG GTATTTCTGC GCAGCCTGCG GGTGGTGATG
GTGCAAACTG TAGGTATGGC GGTGATTGCG GCGTTAATCG GCGCAGGCGG TTTTGGTGCG
CTGGTTTTCC AGGGGCTGCT AAGCAGCGCC ATTGATTTAG TGTTGCTGGG GGTGATCCCG
GTAATTGTTC TGGCGGTGCT TACCGACGCG CTGTTCGATT TGCTTATCGC ACTGCTGAAG
GTGAAACGTA ATGATTGA
 
Protein sequence
MTYFRINPVL ALLLLLTAIA AALPFISYAP NRLVSGEGRH LWQLWPQTIW MLVGVGCAWL 
TACFIPGKKG SICALILAQF VFVLLVWGAG KAATQLAQNG SALARTSLGS GFWLAAALAL
LACSDAIRRI STHPLWRWLL HMQIAIIPLW LLYSGTLNDL SLMKEYANRQ DVFDDALAQH
LTLLFGAVLP ALVIGVPLGI WCYFSTARQG AIFSLLNVIQ TVPSVALFGL LIAPLAALVT
AFPWLGKLGI AGTGMTPALI ALVLYALLPL VRGVVVGLNQ IPRDVLESAR AMGMSGARRF
LHVQLPLALP VFLRSLRVVM VQTVGMAVIA ALIGAGGFGA LVFQGLLSSA IDLVLLGVIP
VIVLAVLTDA LFDLLIALLK VKRND