Gene EcolC_2889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2889 
Symbol 
ID6065335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3147081 
End bp3148151 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content45% 
IMG OID641602294 
Productlambda integrase 
Protein accessionYP_001725843 
Protein GI170020889 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00939202 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0727877 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAGAA GGCGAAGTCA TGAGCGCCGG GATTTACCCC CTAACCTTTA TATAAGAAAC 
AATGGATATT ACTGCTACAG GGACCCAAGG ACGGGTAAAG AGTTTGGATT AGGCCGAGAC
AGGCGAATCG CAATCACTGA AGCTATACAG GCCAACATTG AGTTATTTTC AGGACACAAA
CACAAGCCTC TGACAGCGAG AATCAACAGT GATAATTCCG TTACGTTACA TTCATGGCTT
GATCGCTACG AAAAAATCCT GGCCAGCAGA GGAATCAAGC AGAAGACACT CATAAATTAC
ATGAGCAAAA TTAAAGCAAT AAGGAGGGGT CTGCCTGATG CTCCACTTGA AGACATCACC
ACAAAAGAAA TTGCGGCAAT GCTCAATGGA TACATAGACG AGGGAAAGGC GGCATCAGCC
AAGTTAATCA GATCAACACT GAGCGATGCA TTCCGAGAGG CTATGGCTGA AGGCCATATA
ACAACAAACC CGGTCGCAGC CACTCGCGCT GCAAAATCAG AGGTAAGGAG ATCAAGACTT
ACGGCTGACG AATACCTGAA AATTTATCAA GCAGCAGAAT CATCACCATG TTGGCTTAGA
CTTGCAATGG AACTGGCTGT TGTTACCGGG CAGCGAGTTG GTGATTTATG CGAAATGAAG
TGGTCTGATA TCGTAGATGG ATATCTTTAT GTCGAGCAAA GCAAAACAGG CGTAAAAATT
GCCATCCCAA CAACATTGCA TGTTGATGCT CTCGGGATAT CAATGAAGGA AACACTTGAT
AAATGCAAAA AGATTCTTGG CGGAGAAACC ATAATTGCAT CTACTCGTCG TGAACCGCTT
TCATCCGGCA CAGTATCAAG GTATTTTATG CGCGCACGAA AAGCATCAGG TCTCTCCTTC
GAAGGGGATC CGCCAACCTT TCACGAGTTG CGCAGTTTGT CTGCAAGACT CTATGAGAAG
CAGATAAGCG ATAAATTTGC TCAACATCTT CTCGGGCATA AGTCGGACAC CATGGCATCA
CAGTATCGTG ATGACAGAGG CAGGGAGTGG GACAAAATTG AAATCAAATA A
 
Protein sequence
MGRRRSHERR DLPPNLYIRN NGYYCYRDPR TGKEFGLGRD RRIAITEAIQ ANIELFSGHK 
HKPLTARINS DNSVTLHSWL DRYEKILASR GIKQKTLINY MSKIKAIRRG LPDAPLEDIT
TKEIAAMLNG YIDEGKAASA KLIRSTLSDA FREAMAEGHI TTNPVAATRA AKSEVRRSRL
TADEYLKIYQ AAESSPCWLR LAMELAVVTG QRVGDLCEMK WSDIVDGYLY VEQSKTGVKI
AIPTTLHVDA LGISMKETLD KCKKILGGET IIASTRREPL SSGTVSRYFM RARKASGLSF
EGDPPTFHEL RSLSARLYEK QISDKFAQHL LGHKSDTMAS QYRDDRGREW DKIEIK