Gene ECD_03521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_03521 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp3709771 
End bp3712158 
Gene Length2388 bp 
Protein Length795 aa 
Translation table11 
GC content43% 
IMG OID 
Productconserved hypothetical protein 
Protein accessionACT45319 
Protein GI253979649 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAGATGA AAACATTAAA AAACTGGAAA CTACAAAATC AGTCAGCCCA TCATATAGAG 
CTATTAGTTG ATGGTCAGCA TTCTCTTTGC CTGTATATAC TCGAAGAAAA TATGTTCCGG
GTTCTATTAA AACGGAAGGG GGTTTTGTCG CTGGACCGAA CCTGGAGCAT TGCTCCGGAA
AAGGATGTCC CGTGGGAAGG GCGTCATCGC GAAGATATTA GCGGTTTTTC ACTGCCGACC
TGGAACATGG AGCAGAATGA TGAATTACTG ACAATCACGA CCAGTTCATT ACGCGTAATT
ATTCACAAGC CTTTATGGCT TGAATGGCAT TATAAGGATA ATGCTGGTCA GTGGCAGGAA
CTTGTTAATG ATCGCCCTAC CAGTGCTTAT CTGATAAACG CTCATGGTGA TGGTGTTGCA
CACTATCAAA GTCGACGTAA TGACGAGCGT TTTTATGGAC TTGGTGATAA ATCCGGGGAT
CTGCAACGCA CAGGAAAACG CTATGAAATG CGGAACCTGG ACGCAATGGG ATATAATGCA
GTAAGTACAG ACCCTCTTTA TAAACATATT CCATTTACAA TTACTCACCG TAGCGATATT
AGCTTTGGAT TATTTTATGA TAACCTCAGC AATAGTTGGC TGGATTTAGG TAATGAAATA
GATAATTATC ATACAACTTA CCGCCGTTGG CAGGCTGAGG CGGGAGATAT TGATTATTAT
CTTTTCACTG GCAGATGTGT ACTTGATATC ACCAAAGCCT TTGTTCGTTT GACGGGGAAA
ACACTCTTCG GTCCAAAATG GAGTCTTGGG TACAGCGGTT CCACGATGCA CTATACAGAT
GCACCGGATG CTCAGAATCA ATTGATGCAG TTTATTCGTC TTTGTAAGGA ACATGCTATT
CCATGTGATT CTTTCCAGCT ATCTTCTGGT TATACTTCTA TTAATGGTAA ACGCTACGTA
TTCAACTGGA ATTACGACAA AGTTCCACAC CCTAAAATGA TGAGTCAGGA GTTTCATAAT
GCAGGGATAC ACCTGGCCGC TAATATAAAA CCATGTTTAT TGCAGGATCA TCCTCGCTAT
AACGAAGTAG CAGAACAAGA GCTCTTTATT CGTGACTCTG AATACAACGT CCCGGAACGC
TCCAGCTTCT GGGATGATGA GGGTTCTCAT CTGGACTTTA CGAACCCACA AACAGTCGCA
TGGTGGCAGG AAGGTGTAAC CACACAACTA CTTGAAATGG GAATTGATTC CACCTGGAAT
GATAACAATG AGTTTGAAGT ATGGGATGGG GAAGCGCGCT GTCATGGTTT TGGCAAAGAA
ATCGCCATTA AACACATTCG GCCAGTAATG CCTCTTTTAA TGTGCAGAGC ATCTATGGAA
GCACAGCAAA AATTTGCACC GAACAAACGA CCATATTTGA TTTCCCGCTC TGGATGCGCC
GGATTGCAGC GTTATGTTCA GACATGGAGT GGGGACAACC GAACTAACTG GGATACCCTT
CGATACAACA CCCGCATGGG GCTGGGAATG AGCCTCTCAG GATTGTATAA CATTGGTCAT
GATGTCGGTG GTTTTTCTGG TGATAAGCCT GATCCAGAAT TATTTGTTCG TTGGGTTCAG
AACGGTGTTA TGCACCCCCG ATTTACGATC CACTCGTGGA ATGATGATCA TACTGTCAAT
GAACCATGGA TGTATCCGGA AGTAACACCC GCGATTCGTA GTGCAATTGA GTTACGTTAT
CGTCTTATGC CTTATTTATA TACATTATTG TGGCAGGCAC ATGCCGATGA TGAGCCAATA
TTACGTCCAA CCTTCCTTGA TCATGAGCAC GATGTTCAGA CATTTGAAGA ATGTGATGAT
TTTATGCTGG GACGCGATAT TCTGGTTGCA AGTGTCGTGG AAGCAGGGCA ACGACAACGA
CGAGTATGGT TACCCGACAA CAAAACAGGA TGGTACGACT TTTACAATGG CGAATGGTTC
TGTGGTGGTC AATGGATAAC AATCGACGCG CCACTGGAAA AACTACCATT ATTAGTACGT
GCAGGTGCAG GTATTCCTCT TAGTGAACGT ATAACATACG TGAGTGAAGC GGAAGATAAT
CATCGTAAAT TGAAATTATT CCCAATTAAA GGAACGGGTA AATCCACCGG ACTTCTGTTT
GAAGATGATG GCGAAACTTG GGGATATACT GAAGGTAATG CTCTGTGGCT TGAATGGGAG
CTGGACTGCA CAGCCACTAC AATTGAGTTA AGAATAAACA CTCATGGAGA TTATCGCCCT
GCATGGGAAA CATTGAAAGT GATAATTCCT CAGGGAGAAA GTCGTCAACT ACTGATTAAC
GGCATTGAGG CTTATGAATG GAATATGAAC CTATCCTGTA ATGATTAA
 
Protein sequence
MEMKTLKNWK LQNQSAHHIE LLVDGQHSLC LYILEENMFR VLLKRKGVLS LDRTWSIAPE 
KDVPWEGRHR EDISGFSLPT WNMEQNDELL TITTSSLRVI IHKPLWLEWH YKDNAGQWQE
LVNDRPTSAY LINAHGDGVA HYQSRRNDER FYGLGDKSGD LQRTGKRYEM RNLDAMGYNA
VSTDPLYKHI PFTITHRSDI SFGLFYDNLS NSWLDLGNEI DNYHTTYRRW QAEAGDIDYY
LFTGRCVLDI TKAFVRLTGK TLFGPKWSLG YSGSTMHYTD APDAQNQLMQ FIRLCKEHAI
PCDSFQLSSG YTSINGKRYV FNWNYDKVPH PKMMSQEFHN AGIHLAANIK PCLLQDHPRY
NEVAEQELFI RDSEYNVPER SSFWDDEGSH LDFTNPQTVA WWQEGVTTQL LEMGIDSTWN
DNNEFEVWDG EARCHGFGKE IAIKHIRPVM PLLMCRASME AQQKFAPNKR PYLISRSGCA
GLQRYVQTWS GDNRTNWDTL RYNTRMGLGM SLSGLYNIGH DVGGFSGDKP DPELFVRWVQ
NGVMHPRFTI HSWNDDHTVN EPWMYPEVTP AIRSAIELRY RLMPYLYTLL WQAHADDEPI
LRPTFLDHEH DVQTFEECDD FMLGRDILVA SVVEAGQRQR RVWLPDNKTG WYDFYNGEWF
CGGQWITIDA PLEKLPLLVR AGAGIPLSER ITYVSEAEDN HRKLKLFPIK GTGKSTGLLF
EDDGETWGYT EGNALWLEWE LDCTATTIEL RINTHGDYRP AWETLKVIIP QGESRQLLIN
GIEAYEWNMN LSCND