Gene EcolC_1463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1463 
Symbol 
ID6067277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1612782 
End bp1614542 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content53% 
IMG OID641600883 
Producttype III restriction protein res subunit 
Protein accessionYP_001724453 
Protein GI170019499 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000557084 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.860353 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTTTA CACTTCGCCC ATATCAGCAA GAAGCCGTGG ATGCCACGCT CAACCATTTT 
CGTCGTCATA AAACCCCTGC CGTTATCGTG CTGCCCACCG GCGCAGGTAA AAGCCTGGTG
ATAGCGGAAC TGGCACGGCT GGCTCGTGGT CGCGTGCTGG TGCTGGCACA CGTTAAAGAA
CTGGTGGCGC AAAACCATGC AAAGTATCAG GCGCTGGGGC TGGAAGCCGA TATTTTTGCC
GCCGGGCTAA AGCGCAAAGA GAGCCACGGT AAAGTGGTAT TTGGCAGCGT GCAGTCTGTC
GCCCGTAATC TTGATGCCTT TCAGGGTGAA TTTTCGCTGT TGATTGTCGA TGAATGTCAC
CGTATTGGTG ACGATGAAGA GAGCCAGTAT CAGCAAATCC TCACTCACCT GACAAAAGTG
AATCCCCACT TACGCCTGCT GGGGCTGACT GCCACGCCTT TTCGATTGGG CAAAGGCTGG
ATCTACCAGT TTCATTATCA CGGCATGGTA CGCGGCGATG AGAAAGCCCT TTTCCGTGAC
TGCATTTATG AGCTGCCGCT GCGTTATATG ATTAAACACG GCTATCTGAC GCCGCCAGAA
CGACTGGATA TGCCAGTAGT GCAATACGAT TTCAGCCGCT TGCAGGCACA GAGTAACGGG
CTGTTCAGCG AAGCCGATCT CAACCGTGAG CTGAAAAAAC AACAACGTAT TACCCCGCAC
ATCATCAGCC AGATTATGGA GTTTGCTGCA ACGCGCAAAG GGGTGATGAT TTTTGCCGCG
ACGGTTGAAC ACGCAAAAGA GATTGTGGGA TTACTGCCTG CCGAAGATGC AGCACTGATT
ACTGGCGACA CCCCCGGCGC TGAGCGCGAT GTGTTAATTG AAAATTTTAA AGCCCAGCGT
TTTCGCTATC TGGTCAACGT CGCGGTACTG ACCACCGGAT TTGACGCCCC GCACGTCGAT
CTTATCGCCA TTCTGCGCCC TACCGAATCA GTGAGTCTTT ACCAACAAAT TGTCGGGCGC
GGTCTGCGTC TCGCTCCGGG CAAGACTGAT TGCTTAATTC TTGATTATGC GGGTAATCCT
CACGATCTCT ACGCGCCGGA AGTTGGTACA CCAAAAGGCA AAAGTGACAA CGTTCCGGTA
CAGGTTTTCT GCCCTGCCTG CGGTTTTGCC AACACCTTTT GGGGGAAAAC GACCGCCGAC
GGGACATTGA TTGAACACTT TGGTCGTCGC TGTCAGGGAT GGTTTGAAGA TGACGACGGT
CATCGCGAAC AATGTGACTT CCGTTTCCGT TTTAAAAATT GCCCGCAATG TAACGCGGAA
AACGATATTG CCGCCCGCCG CTGCCGCGAA TGTGACACCG TACTGGTTGA TCCGGACGAT
ATGTTAAAAG CGGCGCTACG ACTGAAAGAC GCGCTGGTAT TACGCTGTAG CGGCATGTCT
TTGCAACATG GGCACGACGA GAAAGGCGAA TGGTTGAAAA TCACCTATTA CGATGAAGAC
GGCGCGGATG TGAGTGAGCG TTTCCGTCTG CAAACACCTG CCCAGCGTAC CGCCTTCGAG
CAGCTTTTTA TCCGCCCGCA TACGCGCACA CCGGGCATCC CGCTGCGCTG GATCACCGCC
GCCGATATCC TCGCCCAGCA AGCCTTATTG CGACACCCGG ATTTTGTCGT CGCCCGCATG
AAAGGCCAGT ACTGGCAGGT GCGTGAAAAA GTGTTCGATT ACGAAGGTCG TTTTCGTCTG
GCGCACGAAT TACGCGGTTA A
 
Protein sequence
MIFTLRPYQQ EAVDATLNHF RRHKTPAVIV LPTGAGKSLV IAELARLARG RVLVLAHVKE 
LVAQNHAKYQ ALGLEADIFA AGLKRKESHG KVVFGSVQSV ARNLDAFQGE FSLLIVDECH
RIGDDEESQY QQILTHLTKV NPHLRLLGLT ATPFRLGKGW IYQFHYHGMV RGDEKALFRD
CIYELPLRYM IKHGYLTPPE RLDMPVVQYD FSRLQAQSNG LFSEADLNRE LKKQQRITPH
IISQIMEFAA TRKGVMIFAA TVEHAKEIVG LLPAEDAALI TGDTPGAERD VLIENFKAQR
FRYLVNVAVL TTGFDAPHVD LIAILRPTES VSLYQQIVGR GLRLAPGKTD CLILDYAGNP
HDLYAPEVGT PKGKSDNVPV QVFCPACGFA NTFWGKTTAD GTLIEHFGRR CQGWFEDDDG
HREQCDFRFR FKNCPQCNAE NDIAARRCRE CDTVLVDPDD MLKAALRLKD ALVLRCSGMS
LQHGHDEKGE WLKITYYDED GADVSERFRL QTPAQRTAFE QLFIRPHTRT PGIPLRWITA
ADILAQQALL RHPDFVVARM KGQYWQVREK VFDYEGRFRL AHELRG