Gene B21_04104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_04104 
SymbolyjhB 
ID8112940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4412406 
End bp4413623 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content36% 
IMG OID644850251 
Producthypothetical protein 
Protein accessionYP_003001824 
Protein GI251787520 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACAG CATGGTATAA ACAAGTTAAT CCACCACAAC GGAAAGCTCT TTTTTCCGCA 
TGGCTTGGAT ATGTATTTGA TGGCTTTGAT TTTATGATGA TATTTTACAT TCTTCATATT
ATAAAAGCAG ATCTTGGCAT TACGGATATT CAGGCTACTT TAATAGGGAC AGTGGCCTTC
ATAGCCAGAC CTATTGGAGG TGGTTTTTTT GGTGCCATGG CTGATAAATA TGGTCGTAAG
CCAATGATGA TGTGGGCAAT TTTCATTTAC TCAGTCGGAA CAGGCCTTAG CGGTATTGCT
ACAAACTTAT ATATGCTCGC AGTTTGCCGT TTTATTGTTG GCTTAGGGAT GTCTGGTGAA
TATGCATGTG CTTCAACTTA TGCGGTAGAA AGTTGGCCTA AAAATCTTCA ATCTAAAGCT
AGTGCTTTTT TGGTAAGTGG TTTTTCTGTT GGAAATATTA TTGCGGCACA AATAATCCCT
CAGTTTGCTG AAGTATATGG ATGGAGAAAC TCTTTTTTTA TAGGCCTGTT ACCAGTTTTA
CTAGTTCTTT GGATCAGAAA AAGTGCTCCA GAAAGTCAGG AGTGGATTGA AGATAAATAT
AAGGATAAAT CAACATTTTT GTCTGTCTTC AGAAAACCAC ATCTTTCAAT CTCTATGATC
GTTTTCCTCG TCTGTTTTTG TCTATTTGGT GCAAACTGGC CGATAAACGG ACTACTTCCT
TCCTACCTGG CAGATAATGG AGTTAATACA GTGGTCATTT CAACTCTGAT GACAATAGCA
GGTTTAGGAA CACTGACAGG TACAATATTT TTTGGTTTTG TTGGTGATAA GATTGGTGTA
AAAAAAGCCT TTGTAGTCGG TCTAATAACT TCATTTATTT TCCTTTGTCC TCTTTTTTTT
ATTTCTGTGA AAAACTCTTC TCTTATAGGA TTATGTCTCT TTGGATTAAT GTTTACAAAT
TTAGGTATTG CAGGGTTGGT TCCAAAATTT ATATATGATT ACTTTCCAAC AAAATTAAGA
GGATTAGGGA CCGGTCTTAT TTATAACTTA GGGGCAACTG GAGGAATGGC CGCACCTGTA
TTAGCTACAT ACATTTCAGG ATATTATGGC TTAGGTGTTT CATTATTCAT TGTTACGGTT
GCATTCTCTG CCTTATTAAT TTTGTTAGTT GGTTTTGATA TTCCAGGTAA AATTTATAAA
CTATCCGTGG CTAAATGA
 
Protein sequence
MATAWYKQVN PPQRKALFSA WLGYVFDGFD FMMIFYILHI IKADLGITDI QATLIGTVAF 
IARPIGGGFF GAMADKYGRK PMMMWAIFIY SVGTGLSGIA TNLYMLAVCR FIVGLGMSGE
YACASTYAVE SWPKNLQSKA SAFLVSGFSV GNIIAAQIIP QFAEVYGWRN SFFIGLLPVL
LVLWIRKSAP ESQEWIEDKY KDKSTFLSVF RKPHLSISMI VFLVCFCLFG ANWPINGLLP
SYLADNGVNT VVISTLMTIA GLGTLTGTIF FGFVGDKIGV KKAFVVGLIT SFIFLCPLFF
ISVKNSSLIG LCLFGLMFTN LGIAGLVPKF IYDYFPTKLR GLGTGLIYNL GATGGMAAPV
LATYISGYYG LGVSLFIVTV AFSALLILLV GFDIPGKIYK LSVAK