Gene B21_03051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03051 
SymbolaaeB 
ID8114034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3246591 
End bp3248558 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content52% 
IMG OID644849235 
Producthypothetical protein 
Protein accessionYP_003000808 
Protein GI251786504 
COG category[S] Function unknown 
COG ID[COG1289] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTATTT TCTCCATTGC TAATCAACAT ATTCGCTTTG CGGTAAAACT GGCGACCGCC 
ATTGTACTGG CGCTGTTTGT TGGCTTTCAC TTCCAGCTGG AAACGCCACG CTGGGCGGTA
CTGACAGCGG CGATTGTTGC TGCCGGTCCG GCCTTTGCTG CGGGAGGTGA ACCGTACTCT
GGCGCGATAC GCTATCGTGG CTTTTTGCGC ATCATCGGCA CATTTATTGG CTGTATTGCC
GGACTGGTGA TCATCATTGC GATGATCCGC GCACCATTAT TGATGATTCT GGTGTGCTGT
ATCTGGGCCG GTTTTTGTAC CTGGATATCC TCGCTGGTAC GAATAGAAAA CTCGTATGCG
TGGGGGCTGG CCGGTTATAC CGCGCTGATC ATTGTGATCA CCATTCAGCC GGAACCATTG
CTTACGCCGC AGTTTGCCGT TGAACGTTGT AGCGAGATCG TTATCGGTAT TGTGTGTGCC
ATTATGGCGG ATTTGCTCTT TTCTCCGCGA TCGATCAAAC AAGAAGTGGA TCGAGAGCTG
GAAAGTTTGC TGGTCGCGCA ATATCAATTA ATGCAACTCT GTATCAAGCA TGGCGATGGT
GAAGTTGTTG ATAAAGCCTG GGGCGACCTG GTGCGACGCA CCACGGCGCT ACAAGGCATG
CGCAGCAACC TGAATATGGA ATCTTCCCGC TGGGCGCGGG CCAATCGACG TTTAAAAGCG
ATCAATACGC TATCGCTGAC GCTGATTACC CAATCCTGCG AAACTTATCT TATTCAGAAT
ACGCGCCCGG AATCGATCAC TAATACTTTC CGCGAATTTT TTGACACGCC GGTAGAAACC
GCGCAGGACG TCCACAAGCA ACTCAAACGC CTGCGAAGAG TTATCGCCTG GACCGGGGAA
CGGGAAACGC CTGTCACCAT TTATAGCTGG GTCGCGGCGG CAACGCGTTA TCAGCTTCTC
AAGCGCGGCG TTATCAGTAA CACAAAAATC AACGCCACCG AAGAAGAGAT CCTGCAAGGC
GAACCGGAAG TAAAAGTAGA GTCAGCCGAA CGTCATCATG CAATGGTTAA CTTCTGGCGA
ACCACACTTT CCTGCATTCT GGGCACGCTT TTCTGGCTGT GGACGGGCTG GACTTCAGGC
AGTGGTGCAA TGGTGATGAT TGCGGTAGTG ACGTCACTGG CAATGCGTTT GCCGAATCCA
CGCATGGTGG CGATCGACTT TATCTACGGG ACGCTGGCCG CGCTGCCGTT AGGGCTGCTC
TACTTTTTGG TGATTATCCC TAATACCCAA CAGAGCATGT TGCTGCTGTG TATTAGCCTG
GCAGTGCTGG GATTTTTCCT CGGTATAGAA GTACAGAAAC GGCGGTTGGG CTCGATGGGG
GCACTGGCCA GCACCATAAA TATTATCGTG CTGGATAACC CGATGACTTT CCATTTCAGT
CAGTTTCTCG ACAGCGCATT AGGGCAAATC GTCGGCTGTG TGCTCGCGTT CACCGTTATT
TTGCTGGTGC GGGATAAATC GCGCGACAGG ACTGGACGTG TACTGCTTAA TCAGTTTGTT
TCTGCTGCTG TTTCCGCGAT GACTACCAAT GTGGCACGTC GTAAAGAGAA CCACCTCCCG
GCACTTTATC AGCAGCTGTT TTTGCTGATG AATAAGTTCC CAGGGGATTT GCCGAAATTT
CGCCTGGCGC TGACGATGAT TATCGCGCAC CAGCGCCTGC GTGATGCGCC GATCCCGGTT
AACGAGGATT TATCGGCGTT TCACCGACAA ATGCGCCGCA CAGCAGACCA TGTAATATCT
GCCCGTAGCG ATGATAAACG TCGTCGGTAC TTTGGCCAGT TGCTGGAAGA ACTTGAAATC
TACCAGGAAA AGCTACGCAT CTGGCAAGCG CCACCGCAGG TGACGGAACC GGTTCATCGG
CTGGCGGGGA TGCTCCTTAA GTATCAACAT GCGTTGACCG ATAGTTAA
 
Protein sequence
MGIFSIANQH IRFAVKLATA IVLALFVGFH FQLETPRWAV LTAAIVAAGP AFAAGGEPYS 
GAIRYRGFLR IIGTFIGCIA GLVIIIAMIR APLLMILVCC IWAGFCTWIS SLVRIENSYA
WGLAGYTALI IVITIQPEPL LTPQFAVERC SEIVIGIVCA IMADLLFSPR SIKQEVDREL
ESLLVAQYQL MQLCIKHGDG EVVDKAWGDL VRRTTALQGM RSNLNMESSR WARANRRLKA
INTLSLTLIT QSCETYLIQN TRPESITNTF REFFDTPVET AQDVHKQLKR LRRVIAWTGE
RETPVTIYSW VAAATRYQLL KRGVISNTKI NATEEEILQG EPEVKVESAE RHHAMVNFWR
TTLSCILGTL FWLWTGWTSG SGAMVMIAVV TSLAMRLPNP RMVAIDFIYG TLAALPLGLL
YFLVIIPNTQ QSMLLLCISL AVLGFFLGIE VQKRRLGSMG ALASTINIIV LDNPMTFHFS
QFLDSALGQI VGCVLAFTVI LLVRDKSRDR TGRVLLNQFV SAAVSAMTTN VARRKENHLP
ALYQQLFLLM NKFPGDLPKF RLALTMIIAH QRLRDAPIPV NEDLSAFHRQ MRRTADHVIS
ARSDDKRRRY FGQLLEELEI YQEKLRIWQA PPQVTEPVHR LAGMLLKYQH ALTDS