Gene B21_04169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_04169 
SymbolyjiO 
ID8114991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4474655 
End bp4475887 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content54% 
IMG OID644850312 
Producthypothetical protein 
Protein accessionYP_003001885 
Protein GI251787581 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.596573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACGTT TTTTTACCCG CCATGCCGCC ACGCTGTTTT TCCCGATGGC GTTGATTTTG 
TATGACTTTG CTGCGTATCT GTCGACGGAT CTGATCCAGC CTGGGATCAT TAATGTGGTA
CGTGATTTTA ATGCCGATGT CAGTCTGGCC CCTGCTGCCG TCAGTCTCTA TCTTGCTGGC
GGTATGGCGT TACAGTGGCT GCTGGGGCCG CTTTCCGACA GAATTGGCCG CAGGCCGGTG
CTGATTACCG GGGCGCTAAT TTTTACCCTT GCCTGCGCCG CGACAATGTT CACAACGTCT
ATGACACAGT TTCTTATCGC GCGTGCAATT CAGGGCACCA GTATCTGTTT TATTGCCACC
GTTGGTTATG TCACGGTGCA GGAGGCGTTC GGACAGACAA AAGGGATCAA GTTGATGGCG
ATTATCACCT CCATCGTACT GATTGCGCCG ATTATCGGCC CGCTTTCCGG CGCAGCTCTG
ATGCACTTTA TGCACTGGAA AGTCCTTTTT GCCATCATTG CGGTTATGGG TTTTATCTCA
TTTGTTGGCT TACTGTTGGC GATGCCAGAG ACGGTGAAGC GCGGCGCGGT TCCGTTTAGC
GCCAAAAGCG TCTTGCGCGA TTTTCGTAAT GTCTTTTGCA ATCGGCTGTT CCTCTTTGGC
GCAGCAACCA TCTCTTTAAG CTATATCCCG ATGATGAGCT GGGTGGCTGT CTCGCCGGTG
ATCCTTATCG ATGCAGGCAG CTTAACAACT TCGCAGTTCG CCTGGACACA GGTTCCGGTG
TTCGGCGCGG TGATTGTTGC GAATGCCATC GTGGCGCGTT TTGTTAAAGA TCCGACCGAA
CCGCGGTTTA TCTGGCGTGC CGTACCCATT CAACTGGTCG GCCTCTCGCT GTTGATTGTC
GGCAATCTGC TGTCGCCGCA CGTCTGGCTG TGGTCGGTGC TGGGCACCAG TCTGTATGCT
TTCGGGATTG GTTTGATTTT CCCGACCTTA TTCCGCTTTA CGCTGTTTTC CAATAAGTTA
CCGAAAGGGA CCGTCTCCGC ATCGCTAAAT ATGGTGATCC TGATGGTGAT GTCGGTCTCG
GTCGAAATCG GCCGCTGGCT ATGGTTTAAC GGCGGTCGCT TGCCGTTTCA TCTGTTAGCC
GTTGTGGCGG GCGTTATCGT CGTTTTCACC CTGGCGGGAT TGCTCAATCG CGTGCGCCAG
CATCAGGCAG CCGAGCTAGT GGAGGAGCAG TGA
 
Protein sequence
MPRFFTRHAA TLFFPMALIL YDFAAYLSTD LIQPGIINVV RDFNADVSLA PAAVSLYLAG 
GMALQWLLGP LSDRIGRRPV LITGALIFTL ACAATMFTTS MTQFLIARAI QGTSICFIAT
VGYVTVQEAF GQTKGIKLMA IITSIVLIAP IIGPLSGAAL MHFMHWKVLF AIIAVMGFIS
FVGLLLAMPE TVKRGAVPFS AKSVLRDFRN VFCNRLFLFG AATISLSYIP MMSWVAVSPV
ILIDAGSLTT SQFAWTQVPV FGAVIVANAI VARFVKDPTE PRFIWRAVPI QLVGLSLLIV
GNLLSPHVWL WSVLGTSLYA FGIGLIFPTL FRFTLFSNKL PKGTVSASLN MVILMVMSVS
VEIGRWLWFN GGRLPFHLLA VVAGVIVVFT LAGLLNRVRQ HQAAELVEEQ