Gene ECD_03043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_03043 
SymbolhflB 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp3193500 
End bp3195434 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content53% 
IMG OID 
Productprotease, ATP-dependent zinc-metallo 
Protein accessionACT44847 
Protein GI253979177 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000537455 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAAA ACCTAATACT CTGGCTGGTC ATTGCCGTTG TGCTGATGTC AGTATTCCAG 
AGCTTTGGGC CCAGCGAGTC TAATGGCCGT AAGGTGGATT ACTCTACCTT CCTACAAGAG
GTCAATAACG ACCAGGTTCG TGAAGCGCGT ATCAACGGAC GTGAAATCAA CGTTACCAAG
AAAGATAGTA ACCGTTATAC CACTTACATT CCGGTTCAGG ATCCGAAATT ACTGGATAAC
CTGTTGACCA AGAACGTCAA GGTTGTCGGT GAACCGCCTG AAGAACCAAG CCTGCTGGCT
TCTATCTTCA TCTCCTGGTT CCCGATGCTG TTGCTGATTG GTGTCTGGAT CTTCTTCATG
CGTCAAATGC AGGGCGGCGG TGGCAAAGGT GCCATGTCGT TTGGTAAGAG CAAAGCGCGC
ATGCTGACGG AAGATCAGAT CAAAACGACC TTTGCTGACG TTGCGGGCTG CGACGAAGCA
AAAGAAGAAG TTGCTGAACT GGTAGAGTAT CTGCGCGAGC CGAGCCGCTT CCAGAAACTC
GGCGGTAAGA TCCCGAAAGG CGTCCTGATG GTCGGTCCTC CGGGTACCGG TAAAACGTTG
CTGGCGAAAG CGATTGCAGG TGAAGCGAAA GTTCCGTTCT TTACTATCTC CGGTTCTGAC
TTCGTAGAAA TGTTCGTCGG TGTGGGTGCA TCCCGTGTTC GTGACATGTT CGAACAGGCG
AAGAAAGCGG CACCGTGCAT CATCTTTATC GATGAAATCG ACGCCGTAGG CCGCCAGCGT
GGCGCTGGTC TGGGCGGTGG TCACGATGAA CGTGAACAGA CTCTGAACCA GATGCTGGTT
GAGATGGATG GCTTCGAAGG TAACGAAGGT ATCATCGTTA TCGCCGCGAC TAACCGTCCG
GACGTTCTCG ACCCGGCCCT GCTGCGTCCT GGCCGTTTCG ACCGTCAGGT TGTGGTCGGC
TTGCCAGATG TTCGCGGTCG TGAGCAGATC CTGAAAGTTC ACATGCGTCG CGTACCATTG
GCCCCCGATA TCGACGCGGC AATCATTGCC CGTGGTACTC CTGGTTTCTC CGGTGCTGAC
CTGGCGAACC TGGTGAACGA AGCGGCACTG TTCGCTGCTC GTGGCAACAA ACGCGTTGTG
TCGATGGTTG AGTTCGAGAA AGCGAAAGAC AAAATCATGA TGGGTGCGGA ACGTCGCTCC
ATGGTGATGA CGGAAGCGCA GAAAGAATCA ACGGCTTACC ACGAAGCGGG TCATGCGATT
ATCGGTCGCC TGGTGCCGGA ACACGATCCG GTGCACAAAG TGACGATTAT CCCACGCGGT
CGTGCGCTGG GTGTGACTTT CTTCTTGCCT GAGGGCGACG CAATCAGCGC CAGCCGTCAG
AAACTGGAAA GCCAGATTTC TACGCTGTAC GGTGGTCGTC TGGCAGAAGA GATCATCTAT
GGGCCGGAAC ATGTTTCTAC CGGTGCGTCC AACGATATTA AAGTTGCGAC CAATCTGGCA
CGTAACATGG TGACCCAGTG GGGCTTCTCT GAGAAATTGG GTCCACTGCT GTACGCGGAA
GAAGAAGGTG AAGTATTCCT CGGCCGTAGC GTAGCGAAAG CGAAACATAT GTCCGATGAA
ACTGCACGTA TCATCGACCA GGAAGTGAAA GCACTGATTG AGCGTAACTA TAATCGTGCG
CGTCAGCTTC TGACCGACAA TATGGATATT CTGCATGCGA TGAAAGATGC TCTCATGAAA
TATGAGACTA TCGACGCACC GCAGATTGAT GACCTGATGG CACGTCGCGA TGTACGTCCG
CCAGCGGGCT GGGAAGAACC AGGCGCTTCT AACAATTCTG GCGACAATGG TAGTCCAAAG
GCTCCTCGTC CGGTTGATGA ACCGCGTACG CCGAACCCGG GTAACACCAT GTCAGAGCAG
TTAGGCGACA AGTAA
 
Protein sequence
MAKNLILWLV IAVVLMSVFQ SFGPSESNGR KVDYSTFLQE VNNDQVREAR INGREINVTK 
KDSNRYTTYI PVQDPKLLDN LLTKNVKVVG EPPEEPSLLA SIFISWFPML LLIGVWIFFM
RQMQGGGGKG AMSFGKSKAR MLTEDQIKTT FADVAGCDEA KEEVAELVEY LREPSRFQKL
GGKIPKGVLM VGPPGTGKTL LAKAIAGEAK VPFFTISGSD FVEMFVGVGA SRVRDMFEQA
KKAAPCIIFI DEIDAVGRQR GAGLGGGHDE REQTLNQMLV EMDGFEGNEG IIVIAATNRP
DVLDPALLRP GRFDRQVVVG LPDVRGREQI LKVHMRRVPL APDIDAAIIA RGTPGFSGAD
LANLVNEAAL FAARGNKRVV SMVEFEKAKD KIMMGAERRS MVMTEAQKES TAYHEAGHAI
IGRLVPEHDP VHKVTIIPRG RALGVTFFLP EGDAISASRQ KLESQISTLY GGRLAEEIIY
GPEHVSTGAS NDIKVATNLA RNMVTQWGFS EKLGPLLYAE EEGEVFLGRS VAKAKHMSDE
TARIIDQEVK ALIERNYNRA RQLLTDNMDI LHAMKDALMK YETIDAPQID DLMARRDVRP
PAGWEEPGAS NNSGDNGSPK APRPVDEPRT PNPGNTMSEQ LGDK