Gene EcolC_1063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1063 
Symbol 
ID6066036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1155680 
End bp1156876 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content45% 
IMG OID641600475 
Productintegrase family protein 
Protein accessionYP_001724057 
Protein GI170019103 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCAAGAA CGACACGCCC CCTGACCAAC ACAGAAGTAC TGCGCGCTAA AGCGTTAGAA 
AAGGATCTAA CGTTGCATGA TGGCGATGGT CTTTTTCTAC TCGTTAAAAC GAACGGTAAG
AAGTTATGGC GTTTCCGTTA TCAACGTCCG GCAACAAAGC AACGGACAAT GATGGGGCTA
GGAGCCTTCC CAGCCCTTTC ACTTGCTGAC GCCCGACGCT TAAGAGCGGA TTACCTTTCC
TTGTTAGCCA ACGGAATTGA CCCGCAAATT CAAGCTGAAA TTGCAGAGGA ACAGCAGCAA
ATCGCACAGG ACAGTATTTT CTCGACGGTC GCCGCTAATT GGTTTCAGCT CAAAAGCAAA
AGTGTTACCC CTGATTATGC AAAAGATATT TGGCGCTCAT TGGAAAAAGA TGTATTCCCC
GCCGTTGGTG AGATGCCCGT TCAGCAGATC AAAGCTAGAA CATTGGTCGA AGCACTTGAG
CCAGTCAAAG CTCGTGGGGC ATTAGAGACT GTACGTCGTC TGGTGCAACG CATTAACGAA
ATAATGATTT ATGCGGTTAA CACTGGCTTG ATTGATGCAA ACCCAGCATC AGGTGTTGGC
ATGGCCTTCG AAAAGCCAAA AAAACAAAAC ATGCCGACGC TTCGACCTGA AGAATTACCA
AAGCTGATGC GTTCTTTAGT CATGTCAAAT CTGTCTATCC CGACTCGCTG TCTAATTGAA
TGGCAACTCC TGACTCTTGT GCGCCCTTCT GAAGCCTCCA GTACTCGGTG GGAAGAAATC
GATCTTCATG CAAAGCTCTG GACGATTCCT GCCGAACGGA TGAAGGCTAA ACGGGAACAC
ATAATTCCTC TATCATCTCA GGCATTAGAG ATTCTTAATG TGATGAAGCC TATTAGTGCT
CATCGTGAAT ATGTTTTTCC GAGTCGGAAT GACCCAAAGA AACCAATGAA CAGTCAGACT
GCAAATGCAG CTTTAAAACG TATTGGTTTT GGCGGAAAAT TAGTTGCCCA TGGATTACGT
TCAATAGCAA GTACAGCCAT GAATGAAGCT GGATTAAATC CTGATGTTAT CGAGTCTGCC
TTAGCCCACA GTGATAAAAA TGAAGTTAGA AAAGCATACA ATCGTTCTAC TTATCTCGTG
CAGCGAATTG AATTGATGGA TTGGTGGGGA GAATACGTTA AAAATAAAAG GGGTTAA
 
Protein sequence
MARTTRPLTN TEVLRAKALE KDLTLHDGDG LFLLVKTNGK KLWRFRYQRP ATKQRTMMGL 
GAFPALSLAD ARRLRADYLS LLANGIDPQI QAEIAEEQQQ IAQDSIFSTV AANWFQLKSK
SVTPDYAKDI WRSLEKDVFP AVGEMPVQQI KARTLVEALE PVKARGALET VRRLVQRINE
IMIYAVNTGL IDANPASGVG MAFEKPKKQN MPTLRPEELP KLMRSLVMSN LSIPTRCLIE
WQLLTLVRPS EASSTRWEEI DLHAKLWTIP AERMKAKREH IIPLSSQALE ILNVMKPISA
HREYVFPSRN DPKKPMNSQT ANAALKRIGF GGKLVAHGLR SIASTAMNEA GLNPDVIESA
LAHSDKNEVR KAYNRSTYLV QRIELMDWWG EYVKNKRG