Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_3723 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 4010374 |
End bp | 4011702 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | transposase IS4 family protein |
Protein accession | ACX41329 |
Protein GI | 260450907 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 45 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACATTG GACAGGCTCT TGATCTGGTA TCCCGTTACG ATTCTCTGCG TAACCCACTG ACTTCTCTGG GGGATTACCT CGACCCCGAA CTCATCTCTC GTTGCCTTGC CGAATCAGGT ACTGTAACGC TACGCAAGCG CCGTCTTCCC CTCGAAATGA TGGTCTGGTG TATTGTTGGC ATGGCGCTTG AGCGTAAAGA ACCTCTTCAC CAGATTGTGA ATCGCCTGGA CATCATGCTG CCGGGCAATC GCCCCTTCGT TGCCCCCAGT GCCGTTATTC AGGCCCGCCA GCGCCTGGGA AGTGAGGCTG TCCGCCGCGT GTTCACGAAA ACAGCGCAGC TCTGGCATAA CGCCACGCCG CATCCGCACT GGTGCGGCCT GACCCTGCTG GCCATCGATG GTGTGTTCTG GCGCACACCG GATACACCAG AGAACGATGC AGCCTTCCCC CGCCAGACAC ATGCCGGGAA CCCGGCGCTC TACCCGCAGG TCAAAATGGT CTGCCAGATG GAACTGACCA GCCATCTGCT GACGGCTGCA GCCTTCGGCA CGATGAAGAA CAGCGAAAAT GAGCTTGCTG AGCAACTTAT AGAACAAACC GGCGATAACA CTCTGACGTT AATGGATAAA GGTTATTACT CACTGGGACT GTTAAATGCC TGGAGCCTGG CGGGAGAACA CCGCCACTGG ATGATACCTC TCAGAAAGGG AGCGCAATAT GAAGAGATCA GAAAACTGGG TAAAGGCGAT CATCTGGTGA AGCTGAAAAC CAGCCCGCAG GCACGAAAAA AGTGGCCGGG ACTGGGAAAT GAAGTGACTG CCCGCCTGCT GACCGTGACG CGCAAAGGAA AAGTCTGCCA TCTGCTGACG TCGATGACGG ACGCCATGCG CTTCCCCGGA GGAGAAATGG GGGATCTGTA CAGTCATCGC TGGGAAATCG AACTGGGATA CAGGGAGATA AAACAGACGA TGCAACGGAG CAGGCTGACG CTGAGAAGTA AAAAGCCGGA GCTTGTGGAG CAAGAGCTGT GGGGTGTCTT ACTGGCTTAT AATCTGGTGA GATATCAGAT GATTAAAATG GCGGAACATC TGAAAGGTTA CTGGCCGAAT CAACTGAGTT TCTCAGAATC ATGCGGAATG GTGATGAGAA TGCTGATGAC ATTGCAGGGC GCTTCACCGG GACGTATACC GGAGCTGATG CGCGATCTTG CAAGTATGGG ACAACTTGTG AAATTACCGA CAAGAAGGGA AAGGGCCTTC CCGAGAGTGG TAAAGGAGAG GCCCTGGAAA TACCCCACAG CCCCGAAAAA GAGCCAGTCA GTTGCTTAA
|
Protein sequence | MHIGQALDLV SRYDSLRNPL TSLGDYLDPE LISRCLAESG TVTLRKRRLP LEMMVWCIVG MALERKEPLH QIVNRLDIML PGNRPFVAPS AVIQARQRLG SEAVRRVFTK TAQLWHNATP HPHWCGLTLL AIDGVFWRTP DTPENDAAFP RQTHAGNPAL YPQVKMVCQM ELTSHLLTAA AFGTMKNSEN ELAEQLIEQT GDNTLTLMDK GYYSLGLLNA WSLAGEHRHW MIPLRKGAQY EEIRKLGKGD HLVKLKTSPQ ARKKWPGLGN EVTARLLTVT RKGKVCHLLT SMTDAMRFPG GEMGDLYSHR WEIELGYREI KQTMQRSRLT LRSKKPELVE QELWGVLLAY NLVRYQMIKM AEHLKGYWPN QLSFSESCGM VMRMLMTLQG ASPGRIPELM RDLASMGQLV KLPTRRERAF PRVVKERPWK YPTAPKKSQS VA
|
| |