Gene B21_02801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02801 
Symbolybl132 
ID8116422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2990005 
End bp2991726 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content53% 
IMG OID644848990 
Producthypothetical protein 
Protein accessionYP_003000563 
Protein GI251786259 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000478815 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAATA AAATCTTTAC GCATTCCCTA CCTATGCGCT ATGCCGATTT TCCAACGCTT 
GTTGATGCTT TGGACTACGC CGCTCTGAGT AGCGCCGGAA TGAATTTTTA TGACAGACGT
TGCCAACTTG AAGATCAACT GGAATATCAG ACGTTAAAAG CACGTGCCGA AGCTGGTGCG
AAGCGGTTGT TATCGCTGAA CCTGAAAAAA GGCGATCGCG TGGCACTGAT TGCCGAAACA
AGTAGCGAGT TCGTAGAGGC TTTTTTTGCC TGCCAGTATG CCGGCTTAGT CGCCGTCCCG
TTGGCGATTC CAATGGGCGT TGGTCAGCGG GATTCCTGGA GCGCCAAATT GCAGGGTTTA
CTGGCAAGTT GCCAGCCCGC AGCCATTATC ACTGGTGATG AGTGGTTGCC ACTGGTCAAT
GCCGCGACGC ATGACAACCC CGAATTACAT GTTTTAAGCC ACGCTTGGTT TAAGGCATTA
TCGGAAGCCG ATGTTGCGCT CCAGCGTCCA GTTCCGAACG ATATCGCCTA CCTCCAGTAC
ACCTCCGGCA GCACCCGTTT TCCCCGTGGC GTCATTATCA CCCATCGCGA AGTGATGGCT
AATCTACGTG CTATAAGCCA CGACGGCATT AAATTACGCC CTGGCGACCG CTGCGCCTCC
TGGCTGCCTT TCTACCATGA TATGGGACTG GTCGGCTTTC TCCTGACCCC CGTCGCCACG
CAGCTTTCAG TAGATTATTT GCGCACTCAG GATTTTGCCA TGCGTCCTCT GCAATGGCTT
AAATTGATCA GTAAAAATCG CGGCACCGTT TCCGTTGCGC CGCCGTTTGG CTATGAATTG
TGCCAGCGCC GCGTGAATGA AAAAGATCTC GCTGAACTGG ATCTTTCCTG CTGGCGCGTC
GCTGGTATTG GTGCAGAACC CATCTCCGCA GAACAACTCC ATCAATTCGC TGAATGTTTC
CGTCAGGTTA ACTTTGACAA TAAAACTTTC ATGCCGTGCT ACGGACTGGC AGAAAATGCG
CTGGCTGTCA GCTTCTCTGA TGAAGCCTCC GGGGTTGTGG TTAACGAAGT GGATCGCGAC
ATCCTCGAAT ATCAGGGTAA AGCCGTCGCG CCGGGTGCAG AGACACGCGC CGTATCGACT
TTCGTCAACT GCGGCAAAGC GTTGCCGGAA CATGGTATTG AAATCCGCAA TGAAGCAGGT
ATGCCGGTCG CGGAACGTGT GGTAGGCCAT ATTTGCATCT CCGGTCCCAG TCTGATGAGC
GGTTACTTTG GCGACCAGGT TTCGCAAGAC GAGATTGCCG CGACGGGCTG GTTAGACACC
GGCGACCTCG GTTATCTGCT GGACGGTTAT CTGTATGTCA CCGGACGCAT TAAAGATCTG
ATTATTATTC GTGGCCGTAA TATCTGGCCG CAGGATATTG AATATATAGC GGAACAGGAA
CCGGAAATTC ATTCTGGCGA TGCGATTGCT TTTGTTACCG CCCAGGAAAA AATCATTTTG
CAGATCCAGT GTCGGATCAG CGACGAAGAA CGTCGCGGGC AGCTTATCCA CGCGCTGGCG
GCACGGATCC AAAGCGAATT TGGCGTGACC GCGGCTATCG CTCTGTTGCC GCCCCACAGT
ATTCCCCGAA CGTCCTCCGG CAAGCCTGCC CGTGCGGAAG CGAAAAAACG TTATCAGAAG
GCTTATGCTG CCAGTCTTAA TGTGCAGGAA TCCCTGGCAT GA
 
Protein sequence
MSNKIFTHSL PMRYADFPTL VDALDYAALS SAGMNFYDRR CQLEDQLEYQ TLKARAEAGA 
KRLLSLNLKK GDRVALIAET SSEFVEAFFA CQYAGLVAVP LAIPMGVGQR DSWSAKLQGL
LASCQPAAII TGDEWLPLVN AATHDNPELH VLSHAWFKAL SEADVALQRP VPNDIAYLQY
TSGSTRFPRG VIITHREVMA NLRAISHDGI KLRPGDRCAS WLPFYHDMGL VGFLLTPVAT
QLSVDYLRTQ DFAMRPLQWL KLISKNRGTV SVAPPFGYEL CQRRVNEKDL AELDLSCWRV
AGIGAEPISA EQLHQFAECF RQVNFDNKTF MPCYGLAENA LAVSFSDEAS GVVVNEVDRD
ILEYQGKAVA PGAETRAVST FVNCGKALPE HGIEIRNEAG MPVAERVVGH ICISGPSLMS
GYFGDQVSQD EIAATGWLDT GDLGYLLDGY LYVTGRIKDL IIIRGRNIWP QDIEYIAEQE
PEIHSGDAIA FVTAQEKIIL QIQCRISDEE RRGQLIHALA ARIQSEFGVT AAIALLPPHS
IPRTSSGKPA RAEAKKRYQK AYAASLNVQE SLA