Gene B21_00789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00789 
Symbolfiu 
ID8112739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp822653 
End bp824935 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content54% 
IMG OID644847055 
Producthypothetical protein 
Protein accessionYP_002998628 
Protein GI251784324 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4774] Outer membrane receptor for monomeric catechols 
TIGRFAM ID[TIGR01783] TonB-dependent siderophore receptor 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAACA ATCGCAATTT CCCTGCCAGA CAATTTCATT CGCTCACGTT CTTTGCCGGT 
CTTTGTATTG GTATTACGCC TGTGGCACAG GCACTCGCCG CCGAAGGGCA AGCTAACGCG
GATGACACGC TGGTTGTCGA AGCATCAACG CCTTCGCTTT ATGCGCCACA ACAATCTGCC
GATCCGAAAT TCTCGCGTCC GGTAGCGGAT ACTACCCGCA CGATGACGGT GATTTCTGAA
CAAGTGATTA AAGATCAGGG CGCAACCAAC CTTACCGACG CGCTCAAAAA CGTCCCCGGC
GTGGGTGCGT TTTTTGCGGG TGAGAACGGT AACTCCACCA CTGGCGACGC CATTTATATG
CGTGGTGCCG ATACCTCTAA CAGCATTTAT ATTGATGGCA TTCGCGATAT CGGCAGCGTC
TCGCGCGACA CCTTCAATAC CGAGCAGGTC GAAGTGATTA AAGGGCCGTC CGGCACCGAC
TACGGGCGCA GCGCACCGAC GGGCTCGATC AATATGATCA GCAAACAGCC GCGCAATGAT
TCCGGCATTG ACGCCTCCGC CAGTATTGGC AGCGCCTGGT TCCGCCGCGG CACGCTGGAC
GTCAATCAGG TCATTGGTGA TACTACCGCG GTGCGCCTGA ATGTGATGGG CGAAAAAACG
CACGATGCCG GACGCGACAA AGTCAAAAAT GAGCGTTACG GCGTCGCCCC TTCTGTCGCT
TTTGGCCTTG GTACAGCGAA TCGTTTGTAT CTTAATTATC TGCATGTCAC TCAGCACAAC
ACGCCAGACG GCGGCATTCC GACCATCGGT TTGCCGGGCT ATTCTGCCCC ATCTGCGGGA
ACGGCGGCCC TGAATCATTC CGGAAAAGTT GATACTCATA ACTTTTACGG CACGGATTCC
GATTACGACG ATTCGACCAC CGACACCGCC ACCATGCGTT TTGAGCACGA CATCAACGAT
AACACCACCA TTCGCAATAC TACCCGTTGG TCGCGCGTAA AGCAGGATTA CCTGATGACG
GCGATTATGG GCGGGGCGTC GAATATTACT CAACCCACCA GCGATGTGAA TAGCTGGACC
TGGTCACGCA CGGCGAATAC CAAAGATGTG AGTAATAAAA TTCTCACCAA CCAGACCAAC
CTGACCTCGA CGTTCTATAC CGGTTCTATC GGTCATGATG TCAGTACCGG CGTGGAATTT
ACCCGTGAAA CGCAGACTAA CTACGGCGTT AATCCGGTGA CGTTACCTGC GGTAAATATT
TATCATCCTG ACAGCAGCAT TCATCCCGGC GGCCTGACGC GCAACGGCGC AAACGCCAAT
GGTCAGACGG ATACCTTCGC AATTTACGCC TTTGATACGC TGCAAATCAC CCGTGATTTT
GAGCTGAACG GCGGGATCCG TCTGGATAAT TATCATACTG AATATGACAG TGCCACCGCC
TGCGGCGGCA GCGGACGCGG TGCCATCACC TGCCCAGTTG GTGTGGCAAA AGGTTCTCCG
GTCACTACCG TCGACACCGC CAGGTCGGGC AATCTGGTGA ACTGGAAAGC CGGGGCGCTG
TATCACCTGA CGGAAAACGG CAATGTCTAT ATTAACTATG CCGTTTCCCA GCAGCCTCCA
GGCGGCAACA ACTTCGCCCT TGCGCAGTCT GGCAGCGGTA ACAGTGCCAA CCGCACCGAT
TTTAAACCGC AAAAAGCCAA CACCAGCGAG ATTGGCACCA AATGGCAGGT TCTGGATAAA
CGCCTGTTGC TCACCGCCGC GCTGTTCCGT ACTGATATCG AAAATGAAGT TGAGCAAAAT
GATGACGGGA CTTACTCGCA ATACGGTAAG AAACGCGTCG AAGGCTATGA GATATCCGTG
GCCGGGAATA TCACTCCCGC GTGGCAGGTG ATTGGCGGCT ATACCCAGCA AAAAGCAACC
ATCAAAAACG GCAAAGATGT TGCCCAGGAT GGTTCCTCAT CGCTGCCGTA TACCCCGGAG
CACGCCTTCA CCTTATGGAG CCAATATCAG GCAACCGACG ATATCTCTGT TGGCGCGGGC
GCACGCTATA TCGGCAGTAT GCATAAAGGT TCAGACGGCG CGGTGGGAAC GCCAGCGTTT
ACCGAAGGTT ACTGGGTCGC CGATGCCAAA CTGGGGTATC GAGTTAATCG CAATCTCGAC
TTCCAGCTAA ACGTTTACAA CCTGTTTGAT ACCGATTACG TCGCCTCAAT CAACAAGAGC
GGCTACCGTT ATCACCCGGG CGAGCCAAGA ACCTTCTTGC TCACAGCCAA TATGCATTTC
TGA
 
Protein sequence
MENNRNFPAR QFHSLTFFAG LCIGITPVAQ ALAAEGQANA DDTLVVEAST PSLYAPQQSA 
DPKFSRPVAD TTRTMTVISE QVIKDQGATN LTDALKNVPG VGAFFAGENG NSTTGDAIYM
RGADTSNSIY IDGIRDIGSV SRDTFNTEQV EVIKGPSGTD YGRSAPTGSI NMISKQPRND
SGIDASASIG SAWFRRGTLD VNQVIGDTTA VRLNVMGEKT HDAGRDKVKN ERYGVAPSVA
FGLGTANRLY LNYLHVTQHN TPDGGIPTIG LPGYSAPSAG TAALNHSGKV DTHNFYGTDS
DYDDSTTDTA TMRFEHDIND NTTIRNTTRW SRVKQDYLMT AIMGGASNIT QPTSDVNSWT
WSRTANTKDV SNKILTNQTN LTSTFYTGSI GHDVSTGVEF TRETQTNYGV NPVTLPAVNI
YHPDSSIHPG GLTRNGANAN GQTDTFAIYA FDTLQITRDF ELNGGIRLDN YHTEYDSATA
CGGSGRGAIT CPVGVAKGSP VTTVDTARSG NLVNWKAGAL YHLTENGNVY INYAVSQQPP
GGNNFALAQS GSGNSANRTD FKPQKANTSE IGTKWQVLDK RLLLTAALFR TDIENEVEQN
DDGTYSQYGK KRVEGYEISV AGNITPAWQV IGGYTQQKAT IKNGKDVAQD GSSSLPYTPE
HAFTLWSQYQ ATDDISVGAG ARYIGSMHKG SDGAVGTPAF TEGYWVADAK LGYRVNRNLD
FQLNVYNLFD TDYVASINKS GYRYHPGEPR TFLLTANMHF