Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3878 |
Symbol | |
ID | 6067834 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4234721 |
End bp | 4235986 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641603294 |
Product | integrase family protein |
Protein accession | YP_001726809 |
Protein GI | 170021855 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0582] Integrase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.642125 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACTGA CTGACGCAAA AATCCGGGCT GCAAAGCCCA CTGACAAGGC TTATAAACTC ACTGACGGGG CTGGCATGTT CCTGCTGGTA CATCCCAATG GTTCCCGTTA CTGGCGTCTC CGTTATCGTA TTCTGGGTAA GGAGAAGACT CTGGCACTTG GTGTGTATCC GGAAGTTTCT CTCTCCGAAG CTCGTACAAA ACGGGATGAG GCCCGAAAAC TGATTTCGGA GGGGGTTGAC CCTTGCGAAC AAAAAAGAGC TAAAAAAGTA GTCCCTGATT TACAACTCTC TTTTGAACAT ATTGCACGAC GCTGGCATGC CAGTAATAAA CAATGGGCAC AATCACACAG CGATAAAGTA CTCAAAAGCC TCGAGACACA CGTTTTCCCC TTTATCGGCA ACCGGGATAT CACAACACTC CGTACCCCGG ACCTGCTTAT CCCTGTTCGT GCTGCAGAAG CAAAACAAAT TTATGAAATC GCCAGTCGTC TGCAGCAAAG AATATCTGCT GTAATGCGTT ATGCCGTACA GTCTGGCATC ATCAGATATA ATCCTGCTCT GGATATGGCT GGCGCATTGA CCACTGTAAA ACGCCAGCAT CGCCCCGCTC TTGATCTTTC TCGCCTGCCT GAACTTTTGT CGCGTATTAG CAGTTACAAG GGGCAACCTG TCACCCAGCT TGCCGTTACG CTGAATTTAC TGGTTTTTAT TCGTTCCAGT GAACTCAGAT ACGCCCGGTG GTCTGAAATT GATATTGACA ATGCCATGTG GACTATTCCA GCCGAACGCG AACCTCTGCC CGGCGTAAAA TTCTCACACC GGGGCTCCAA GATGCGAACA CCACATCTTG TGCCACTCAG CAAACAGGCT GTAGCCATAC TGACAGAACT TCAGACATGG GCAGGTGAAA ATGGTCTGAT ATTTACGGGA GCACATGACC CGCGTAAACC AATCAGTGAA AACACCGTAA ATAAGGCACT GAGGGGTATG GGATATGACA CAACCCAGGA TGTCTGTGGG CATGGGTTCA GGGCGATGGC GTGCAGTGCG TTAATAGAAT CAGGTTTGTG GTCCCGTGAT GCAGTTGAAC GTCAGATGAG CCATCAGGAA CGCAATGGTG TACGTGCTGC TTACATTCAT AAAGCAGAAC ATCTGGAAGA ACGCCGACTG ATGTTACAGT GGTGGGCCGA TTTTCTGGAT GCAAACAGAG AAAAATTTAT CAGTCCATTT GAATATGCAA AGATTAATAA TCCATTAAAA CCGTAA
|
Protein sequence | MALTDAKIRA AKPTDKAYKL TDGAGMFLLV HPNGSRYWRL RYRILGKEKT LALGVYPEVS LSEARTKRDE ARKLISEGVD PCEQKRAKKV VPDLQLSFEH IARRWHASNK QWAQSHSDKV LKSLETHVFP FIGNRDITTL RTPDLLIPVR AAEAKQIYEI ASRLQQRISA VMRYAVQSGI IRYNPALDMA GALTTVKRQH RPALDLSRLP ELLSRISSYK GQPVTQLAVT LNLLVFIRSS ELRYARWSEI DIDNAMWTIP AEREPLPGVK FSHRGSKMRT PHLVPLSKQA VAILTELQTW AGENGLIFTG AHDPRKPISE NTVNKALRGM GYDTTQDVCG HGFRAMACSA LIESGLWSRD AVERQMSHQE RNGVRAAYIH KAEHLEERRL MLQWWADFLD ANREKFISPF EYAKINNPLK P
|
| |