Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_4224 |
Symbol | |
ID | 5110385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009425 |
Strand | + |
Start bp | 34205 |
End bp | 37189 |
Gene Length | 2985 bp |
Protein Length | 994 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640480841 |
Product | transposase Tn3 family protein |
Protein accession | YP_001165503 |
Protein GI | 146284550 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4644] Transposase and inactivated derivatives, TnpA family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.398551 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACGGA GACAAATACT CAGCAGTGAA GAAAAAGAGC GCTTGCTGGT TGTACCGGAT GATGACGTTC TTCTGACGCG CATGTGTTTT TTGAGTGAAC CTGACCTCGC TCTTATTAAT AAACACCGCA GACCTGCCAA TCGCCTGGGC TTTGCTGTTT TACTCTGTTA TTTGCGAGGG CCGGGGTTTC CTCCCGACAA GAATATGTCC CCTCACGACG GGGTTGTTTC CCGGCTTGCG GCTCATTTGA AACTTCAGCC TGATTTGTGG GCCGAGTATG CATCAAGAGA GGTCACCCGC TGGGAGCATC TGGCCGAACT ATACCGCTAT CTGGAATTAT CCCCTTTCAA CCGGGCGCTG CAAAAAACCT GCATTCGCCA CCTTTATCCC CACGCCATGC GGACAGACAG AGGTTTTTTG CTTGCGGAAG AAATGCTCTC CTGGCTTCAC AATAATAATG TCATTTTCCC CTCAGTTGAT GTCATTGAGC GGACCCTGGC CGAAGCAGCA ACGCTTGCAG ACAGGGCGGT TTTTTCTGCG CTTATCACGC AGCTTGAACC AGGGCACAAA GCGGCACTGG ACCGTCTGCT GGTATCTGAG GGTGAGCAAC CCTCACGGCT GGCCTGGCTG CTCCAGCCTC CGGGAAAAAT TAATGGTAAA AATGTCCTGC AACATATCGA CCGGTTAAAT GCGATTGAGG CGCTGGCACT GCCTGACGGT ATTGCACTTT CCGTTCACCA GAACCGGCTC CTGAAATTGG CACGCGAAGG CAGGAAAATG AGTAGCCGGG ACCTGGCAAA ATTCACGGAT GTTCGCCGTT ATGCCTCGCT GGTTTGTGTC ATATCGGAAG CCAGGGCCAC CCTGACTGAC GAAGTTATTG ATCTGCACGA GCGTATTCTG GGCAGTCTGT TCAGCAGGGC AAAACGCACG CAAGCCGAAC GGCTTCAACA AACGGGAAAG CTGATTCAGA GCAAGCTGAA GCAATACGTT ACCATCGGGC AGGCATTGCT TAACGCCAGA GAGTCTGGTG AAGATCCCTG GGCCGCAATA GAAGATGTCC TTCCCTGGCA GGAATTTATC AACAGCGTGG AAGAGACGCG CTTCCTGTCA CGTAAGGACA ACTTTGACCC GTTGCATCTG ATCACAGAAA AATACAGTAC GCTGCGTAAA TACGCCCCCC GGATGCTGTC CGCGTTGCAG TTCAGGGCTG CACCCGCTGC ATTGCAACTC AGTGACGCGC TGGACACCGT CAGAGAGATG TATCGAAAAC AACTCCGAAA AGTACCGCCT TCCGCGCCAA CCGGGTTCAT CCCGGAAAGC TGGAGAAAGG CGGTGATCAC TCCCACCGGC ATTGACCGAA AATATTACGA ATTTTGCGTG CTGAATGAGC TGAAGGGAGC CTTACGTTCT GGTGACATCT GGGTAAAGGG ATCACGCCGC TACCGGAATT TCGATGATTA TCTGATCCCG TCTGACGACT TTGAAAAATC ACTCCGGGAT AATCAGCTAT CCCTTGCCAT TCCGACTGAT TGCCATGAGT ACATCAAGAA CCGTATGACA CTTCTGACAT CGCGCCTGGA GGAAGTTAAT GCGATGGCGC TGGCCGGTGA TTTACCGGAT GTTGATATAT CGGATAAAGG CGTGAAAATA ACCCCGCTGG ATAATAGTGT CCCTTCAGCA GTATCTCCTT TTGCCGATTT GGTTTATGGC ATGCTGCCTC ACCCTAAAAT TACTGAAATT CTGGATGAAG TGGACGGCTG GACCGGTTTT ACCCGCTATT TCACGCATCT TAAAAATAAA CACGTCAGAC CAAAAGACAG AAAGTTGTTA CTGACCACCA TCCTGGCCGA TGGCATCAAT CTGGGGCTGA CAAAAATGGC CGAATCCTGC CCGGGAACGA CAAAATCGTC ACTGGAGGGC ATTCAGGCCT GGTACATCAG GGATGAAACC TATTCAGCCG CGCTGGCTGA GCTGGTTAAT GCTCAGAAAC AGCGGCATCT GGCTGCATTT TGGGGCGATG GTACAACGTC TTCGTCTGAC GGGCAGAACT TTCGGGTGGG TAGTCACGGA CGTTATGCGG GTCAGGTCAA TCTGAAATAC GGGCAGGAAC CAGGCGTGCA GATTTACACG CATATCTCGG ATCAATACAG TCCATTTTAC ACCAAAGTGA TCAGTCGCGT GCGCGACTCC ACCCACGTAC TTGACGGTCT GCTATACCAC GAAAGTGATC TGGAAATTAC AGAGCATTAC ACTGATACCG CTGGTTTTAC GGAGCATGTT TTCGCGCTGA TGCATCTGCT CGGATTTGCT TTTTCCCCAA GGATCCGGGA TCTTCACGAC AAGCGGATTT TTATTCAGGG AAAGGCTGAG CGTTATCCGG GACTTCAATC TGTTATATCC ACAACACCAC TGAATCTTAA AGACATTGAG ACGCACTGGA ATGAGGTACT TCGCCTCGCA AGCTCGATAA AACAGGGGAC TGTTACCGCA TCGCTCATGA TGAAAAAACT GGCCAGTTAT CCTAAACAAA ATGGCCTTGC AAAGGCATTG AGGGTAATCG GACGCATCGA ACGGGCACTG TTTATGCTGG ACTGGTTTCG CGATCCATCA CTGCGCCGAC GCGTACAGGC AGGGCTGAAT AAAGGTGAGG CCCGCAATGC GCTTGCACGA GCGGTCTTTA TGCACCGGTT GGGTGAGATC AGAGATCGGG GGCTGGAGAA TCAGAGCTAC CGGGCCAGTG GTCTGACGTT GCTGACTGCG GCGATCTCCC TGTGGGATAC GGTATATATA GAAAGAGCTA TAGATTCCCT GAGACGAAAA GGGATCCCGA TTAATGAGCA ACTGATTTCT CATTTGTCCC CGCTGGGATG GGAGCACATC AATCTGAGTG GGGATTACGT CTGGCGGACA AACCTTAAGC TGGGACAGGG TAAATACCGT TCATTACGCT CAGTGGATAG TGGTCTGTAC AAAAAACAAG CTTAG
|
Protein sequence | MPRRQILSSE EKERLLVVPD DDVLLTRMCF LSEPDLALIN KHRRPANRLG FAVLLCYLRG PGFPPDKNMS PHDGVVSRLA AHLKLQPDLW AEYASREVTR WEHLAELYRY LELSPFNRAL QKTCIRHLYP HAMRTDRGFL LAEEMLSWLH NNNVIFPSVD VIERTLAEAA TLADRAVFSA LITQLEPGHK AALDRLLVSE GEQPSRLAWL LQPPGKINGK NVLQHIDRLN AIEALALPDG IALSVHQNRL LKLAREGRKM SSRDLAKFTD VRRYASLVCV ISEARATLTD EVIDLHERIL GSLFSRAKRT QAERLQQTGK LIQSKLKQYV TIGQALLNAR ESGEDPWAAI EDVLPWQEFI NSVEETRFLS RKDNFDPLHL ITEKYSTLRK YAPRMLSALQ FRAAPAALQL SDALDTVREM YRKQLRKVPP SAPTGFIPES WRKAVITPTG IDRKYYEFCV LNELKGALRS GDIWVKGSRR YRNFDDYLIP SDDFEKSLRD NQLSLAIPTD CHEYIKNRMT LLTSRLEEVN AMALAGDLPD VDISDKGVKI TPLDNSVPSA VSPFADLVYG MLPHPKITEI LDEVDGWTGF TRYFTHLKNK HVRPKDRKLL LTTILADGIN LGLTKMAESC PGTTKSSLEG IQAWYIRDET YSAALAELVN AQKQRHLAAF WGDGTTSSSD GQNFRVGSHG RYAGQVNLKY GQEPGVQIYT HISDQYSPFY TKVISRVRDS THVLDGLLYH ESDLEITEHY TDTAGFTEHV FALMHLLGFA FSPRIRDLHD KRIFIQGKAE RYPGLQSVIS TTPLNLKDIE THWNEVLRLA SSIKQGTVTA SLMMKKLASY PKQNGLAKAL RVIGRIERAL FMLDWFRDPS LRRRVQAGLN KGEARNALAR AVFMHRLGEI RDRGLENQSY RASGLTLLTA AISLWDTVYI ERAIDSLRRK GIPINEQLIS HLSPLGWEHI NLSGDYVWRT NLKLGQGKYR SLRSVDSGLY KKQA
|
| |