Gene Ent638_4224 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_4224 
Symbol 
ID5110385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009425 
Strand
Start bp34205 
End bp37189 
Gene Length2985 bp 
Protein Length994 aa 
Translation table11 
GC content51% 
IMG OID640480841 
Producttransposase Tn3 family protein 
Protein accessionYP_001165503 
Protein GI146284550 
COG category[L] Replication, recombination and repair 
COG ID[COG4644] Transposase and inactivated derivatives, TnpA family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.398551 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACGGA GACAAATACT CAGCAGTGAA GAAAAAGAGC GCTTGCTGGT TGTACCGGAT 
GATGACGTTC TTCTGACGCG CATGTGTTTT TTGAGTGAAC CTGACCTCGC TCTTATTAAT
AAACACCGCA GACCTGCCAA TCGCCTGGGC TTTGCTGTTT TACTCTGTTA TTTGCGAGGG
CCGGGGTTTC CTCCCGACAA GAATATGTCC CCTCACGACG GGGTTGTTTC CCGGCTTGCG
GCTCATTTGA AACTTCAGCC TGATTTGTGG GCCGAGTATG CATCAAGAGA GGTCACCCGC
TGGGAGCATC TGGCCGAACT ATACCGCTAT CTGGAATTAT CCCCTTTCAA CCGGGCGCTG
CAAAAAACCT GCATTCGCCA CCTTTATCCC CACGCCATGC GGACAGACAG AGGTTTTTTG
CTTGCGGAAG AAATGCTCTC CTGGCTTCAC AATAATAATG TCATTTTCCC CTCAGTTGAT
GTCATTGAGC GGACCCTGGC CGAAGCAGCA ACGCTTGCAG ACAGGGCGGT TTTTTCTGCG
CTTATCACGC AGCTTGAACC AGGGCACAAA GCGGCACTGG ACCGTCTGCT GGTATCTGAG
GGTGAGCAAC CCTCACGGCT GGCCTGGCTG CTCCAGCCTC CGGGAAAAAT TAATGGTAAA
AATGTCCTGC AACATATCGA CCGGTTAAAT GCGATTGAGG CGCTGGCACT GCCTGACGGT
ATTGCACTTT CCGTTCACCA GAACCGGCTC CTGAAATTGG CACGCGAAGG CAGGAAAATG
AGTAGCCGGG ACCTGGCAAA ATTCACGGAT GTTCGCCGTT ATGCCTCGCT GGTTTGTGTC
ATATCGGAAG CCAGGGCCAC CCTGACTGAC GAAGTTATTG ATCTGCACGA GCGTATTCTG
GGCAGTCTGT TCAGCAGGGC AAAACGCACG CAAGCCGAAC GGCTTCAACA AACGGGAAAG
CTGATTCAGA GCAAGCTGAA GCAATACGTT ACCATCGGGC AGGCATTGCT TAACGCCAGA
GAGTCTGGTG AAGATCCCTG GGCCGCAATA GAAGATGTCC TTCCCTGGCA GGAATTTATC
AACAGCGTGG AAGAGACGCG CTTCCTGTCA CGTAAGGACA ACTTTGACCC GTTGCATCTG
ATCACAGAAA AATACAGTAC GCTGCGTAAA TACGCCCCCC GGATGCTGTC CGCGTTGCAG
TTCAGGGCTG CACCCGCTGC ATTGCAACTC AGTGACGCGC TGGACACCGT CAGAGAGATG
TATCGAAAAC AACTCCGAAA AGTACCGCCT TCCGCGCCAA CCGGGTTCAT CCCGGAAAGC
TGGAGAAAGG CGGTGATCAC TCCCACCGGC ATTGACCGAA AATATTACGA ATTTTGCGTG
CTGAATGAGC TGAAGGGAGC CTTACGTTCT GGTGACATCT GGGTAAAGGG ATCACGCCGC
TACCGGAATT TCGATGATTA TCTGATCCCG TCTGACGACT TTGAAAAATC ACTCCGGGAT
AATCAGCTAT CCCTTGCCAT TCCGACTGAT TGCCATGAGT ACATCAAGAA CCGTATGACA
CTTCTGACAT CGCGCCTGGA GGAAGTTAAT GCGATGGCGC TGGCCGGTGA TTTACCGGAT
GTTGATATAT CGGATAAAGG CGTGAAAATA ACCCCGCTGG ATAATAGTGT CCCTTCAGCA
GTATCTCCTT TTGCCGATTT GGTTTATGGC ATGCTGCCTC ACCCTAAAAT TACTGAAATT
CTGGATGAAG TGGACGGCTG GACCGGTTTT ACCCGCTATT TCACGCATCT TAAAAATAAA
CACGTCAGAC CAAAAGACAG AAAGTTGTTA CTGACCACCA TCCTGGCCGA TGGCATCAAT
CTGGGGCTGA CAAAAATGGC CGAATCCTGC CCGGGAACGA CAAAATCGTC ACTGGAGGGC
ATTCAGGCCT GGTACATCAG GGATGAAACC TATTCAGCCG CGCTGGCTGA GCTGGTTAAT
GCTCAGAAAC AGCGGCATCT GGCTGCATTT TGGGGCGATG GTACAACGTC TTCGTCTGAC
GGGCAGAACT TTCGGGTGGG TAGTCACGGA CGTTATGCGG GTCAGGTCAA TCTGAAATAC
GGGCAGGAAC CAGGCGTGCA GATTTACACG CATATCTCGG ATCAATACAG TCCATTTTAC
ACCAAAGTGA TCAGTCGCGT GCGCGACTCC ACCCACGTAC TTGACGGTCT GCTATACCAC
GAAAGTGATC TGGAAATTAC AGAGCATTAC ACTGATACCG CTGGTTTTAC GGAGCATGTT
TTCGCGCTGA TGCATCTGCT CGGATTTGCT TTTTCCCCAA GGATCCGGGA TCTTCACGAC
AAGCGGATTT TTATTCAGGG AAAGGCTGAG CGTTATCCGG GACTTCAATC TGTTATATCC
ACAACACCAC TGAATCTTAA AGACATTGAG ACGCACTGGA ATGAGGTACT TCGCCTCGCA
AGCTCGATAA AACAGGGGAC TGTTACCGCA TCGCTCATGA TGAAAAAACT GGCCAGTTAT
CCTAAACAAA ATGGCCTTGC AAAGGCATTG AGGGTAATCG GACGCATCGA ACGGGCACTG
TTTATGCTGG ACTGGTTTCG CGATCCATCA CTGCGCCGAC GCGTACAGGC AGGGCTGAAT
AAAGGTGAGG CCCGCAATGC GCTTGCACGA GCGGTCTTTA TGCACCGGTT GGGTGAGATC
AGAGATCGGG GGCTGGAGAA TCAGAGCTAC CGGGCCAGTG GTCTGACGTT GCTGACTGCG
GCGATCTCCC TGTGGGATAC GGTATATATA GAAAGAGCTA TAGATTCCCT GAGACGAAAA
GGGATCCCGA TTAATGAGCA ACTGATTTCT CATTTGTCCC CGCTGGGATG GGAGCACATC
AATCTGAGTG GGGATTACGT CTGGCGGACA AACCTTAAGC TGGGACAGGG TAAATACCGT
TCATTACGCT CAGTGGATAG TGGTCTGTAC AAAAAACAAG CTTAG
 
Protein sequence
MPRRQILSSE EKERLLVVPD DDVLLTRMCF LSEPDLALIN KHRRPANRLG FAVLLCYLRG 
PGFPPDKNMS PHDGVVSRLA AHLKLQPDLW AEYASREVTR WEHLAELYRY LELSPFNRAL
QKTCIRHLYP HAMRTDRGFL LAEEMLSWLH NNNVIFPSVD VIERTLAEAA TLADRAVFSA
LITQLEPGHK AALDRLLVSE GEQPSRLAWL LQPPGKINGK NVLQHIDRLN AIEALALPDG
IALSVHQNRL LKLAREGRKM SSRDLAKFTD VRRYASLVCV ISEARATLTD EVIDLHERIL
GSLFSRAKRT QAERLQQTGK LIQSKLKQYV TIGQALLNAR ESGEDPWAAI EDVLPWQEFI
NSVEETRFLS RKDNFDPLHL ITEKYSTLRK YAPRMLSALQ FRAAPAALQL SDALDTVREM
YRKQLRKVPP SAPTGFIPES WRKAVITPTG IDRKYYEFCV LNELKGALRS GDIWVKGSRR
YRNFDDYLIP SDDFEKSLRD NQLSLAIPTD CHEYIKNRMT LLTSRLEEVN AMALAGDLPD
VDISDKGVKI TPLDNSVPSA VSPFADLVYG MLPHPKITEI LDEVDGWTGF TRYFTHLKNK
HVRPKDRKLL LTTILADGIN LGLTKMAESC PGTTKSSLEG IQAWYIRDET YSAALAELVN
AQKQRHLAAF WGDGTTSSSD GQNFRVGSHG RYAGQVNLKY GQEPGVQIYT HISDQYSPFY
TKVISRVRDS THVLDGLLYH ESDLEITEHY TDTAGFTEHV FALMHLLGFA FSPRIRDLHD
KRIFIQGKAE RYPGLQSVIS TTPLNLKDIE THWNEVLRLA SSIKQGTVTA SLMMKKLASY
PKQNGLAKAL RVIGRIERAL FMLDWFRDPS LRRRVQAGLN KGEARNALAR AVFMHRLGEI
RDRGLENQSY RASGLTLLTA AISLWDTVYI ERAIDSLRRK GIPINEQLIS HLSPLGWEHI
NLSGDYVWRT NLKLGQGKYR SLRSVDSGLY KKQA