Gene EcolC_1237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1237 
Symbol 
ID6067412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1356303 
End bp1357508 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content54% 
IMG OID641600652 
Productintegrase family protein 
Protein accessionYP_001724230 
Protein GI170019276 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.184026 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCAAGA CACCACTGAC AGCAAAGGCC ATAGATGCCG CACAACCACA GGACAAGCCC 
TACAAACTCA CAGATTCACA AACGCCAGGC CTTTTCTTGC TGGTCCATCC CAACGGTAGT
AAGTACTGGC GATTCCGGTA CTGGATAGAT AAGAAAGAGC GATTACAGGC CGTCGGGGTA
TATCCGCTGA TTAGCCTCAA GGAAGCCCGC AAACGCGCCA CAGAGAGCCG TTTACTGATA
GCCCAGGGAA TTGACCCAAT GGAAGAAGCG CGCAAGGAGA AAGCCATTGA TGCGCTCAAC
ATGGCGGCAA GTTTTAAGAC CGTGGCGGAG GACTGGCTTG CTACCAGGGT TAGCGGTTGG
TCAGAGTCCT ACACGAAACA GGTCAGATCG GCACTGGAGA AAGACGTTTA TCCGGTACTT
GGCAAGCGTT CAATCGTCGA TATAACGGCC CGTGATGTTC TGTCATTGCT TCAGAAGAAA
GAGCGCACCG CACCGGAACA AGCCAGGAAG CTACGCCAGC GTATCGGGGA GATCTTCAAA
TTTGCCGTTA TCACCGAACT GGTTAACCGG AATCCGGTTG CAGATCTGGA TACGGCATTG
AAAGCCAGAC GCCCAGGCCA TAACGCATGG ATACCGATTA GTGAAATTCC GGCATTCTAC
AAAGCCCTTG AGAGAGCCGG GAGCGTCCAG ATTCAGACGG CAATACGTTT GCTTATCCTC
ACGGCTTTGA GGACGGCAGA GCTTCGTTTA ATGCGCTGGG AGTGGGTGGA TCTGGAGTCG
GCAACAATCA CCCTACCCGC TGAAGTCATG AAGGCCCGCC GACCGCATGT AGTCCCGTTA
TCCCGGCAAG CGGTCGAGCT ATTACAGGAC CAGTTTACCC GCAGCGGATA CAGTGCTTTC
GTCTTTCCGG GCCGATTCAT GGATAAGCCA TTGTCAGCCA GTGCGATCCT TAAAGCCCTG
GAGCGTATCG GGTACAAGTC GATCGCCACT GGTCATGGCT GGAGGACAAC GTTCAGCACA
TCACTTAACG AAAGCGGCAG ATACAATCCC GACTGGATCG AAATCCAACT GGCCCACGTT
CCGAAAGGTG TGCGCGGCGT TTATAACCAG GCGGCCTATC TGAAGCAACG GCGGGCCATG
ATGCAGAACT ACAGCGACGC CATCGACCAG ATATTGGCTG GTGACGGTAA TCCACTTGAA
CCGTGA
 
Protein sequence
MSKTPLTAKA IDAAQPQDKP YKLTDSQTPG LFLLVHPNGS KYWRFRYWID KKERLQAVGV 
YPLISLKEAR KRATESRLLI AQGIDPMEEA RKEKAIDALN MAASFKTVAE DWLATRVSGW
SESYTKQVRS ALEKDVYPVL GKRSIVDITA RDVLSLLQKK ERTAPEQARK LRQRIGEIFK
FAVITELVNR NPVADLDTAL KARRPGHNAW IPISEIPAFY KALERAGSVQ IQTAIRLLIL
TALRTAELRL MRWEWVDLES ATITLPAEVM KARRPHVVPL SRQAVELLQD QFTRSGYSAF
VFPGRFMDKP LSASAILKAL ERIGYKSIAT GHGWRTTFST SLNESGRYNP DWIEIQLAHV
PKGVRGVYNQ AAYLKQRRAM MQNYSDAIDQ ILAGDGNPLE P