Gene pE33L466_0069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagpE33L466_0069 
SymboltnpA 
ID3399589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_007103 
Strand
Start bp70225 
End bp73176 
Gene Length2952 bp 
Protein Length983 aa 
Translation table11 
GC content35% 
IMG OID637659908 
Producttransposase 
Protein accessionYP_245572 
Protein GI67077952 
COG category[L] Replication, recombination and repair 
COG ID[COG4644] Transposase and inactivated derivatives, TnpA family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.197495 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGGAA AAGAACTTCT CACGCCTGTA CAAAGGGAAG AACTATTACA TATTTCTGTT 
GAAACAGAAC ATGAGTTAGC ACTGCATTAC ACATTTTCTA CAGAGGACCT TGAAATCATT
AATCAACATC GTAGAGATCA CAATCGCTTA GGATTTGCTG TTCAGCTCTG TATCTTAAGA
TATCCTGGAT GTACTGTTAC GAATATGCCA ACAATTCCTG AAGGACTCTT AAAGTTTGTT
GCTAAACAAA TCAGTGTAGA CCACACGGTC TATGAAGCAT ATGCAAAACG AGAACCTACT
AGGCGAGAGC ACTTAGAAGA AATACGTAAA GAATATGGAT ATCGTAACTT TACTATTCGT
GATTATCGAA GGATTTCTAA ATTTCTACAA CCATATGCAT TAGAAAATGG AAATACAATG
TATCTGATTC AAACTGCATT ACAAGAACTA CGAAAAGAGA AAATTATTTT ACCTGCAATA
CCAACAATAG AACGAGCTGT ATGGGAAGTA CGTAAGCGCA CAGAAGAAAA GATTTTTAAA
GTACTAACCT CCTCTTTGAC TTCTTCACAA CAAGAAAAGT TAGATAGACT ACTCCATCCA
ATGCCTAATA CTTCAAAAAC TTATTTATCA TGGCTAAGAG AAATACCTGG CCAATTTTCT
CCTGATGCAT TTTTAAAGGT AATTGAACGC TTAGATTACA TAAGAAGATT GACATTAAAG
CTTGACACCA AAGGTTTACA TCCAAATCGA ATACGACAAT TATCTCGAAT AGGAGCAAGG
TATGAACCCT TCTCGTTTCG ACGATTTCAC AGTACCAAAA AATATGCGAT CATGATTGCA
TATTTAATAG ATTTAACGCA AGACTTGGTT GATCAAGCAT TTGAGATTCA TGATAAACAA
ATCATGAATC TTCAACTAAA AGGAAGAAAA CAGCAAGAAG AAATTCAAAA ACAGAATGGA
AAATCAGTTA ATGAAAAAAT CAATCATTAC GCGAATTTAG GAACAGCTTT AATTAAAGCT
AGACAAGAAA ACTTAGATCC ATTTTTATTA TTAGAAACGG TGATGCCTTG GGATACATTT
GTAGCCTCTG TAGAGGAAGC AAAAAAGTTA TCTCGTCCTA TGAATTATGA TTACCTTGAT
TTATTAGAAA GTCGTTATAA TTATTTACGT AAGTATACGC CAACACTACT CAGAACATTA
GAATTTCAAT CTACCAAGTA TGCAAATCCC GTTTTATTAG CTTTAGACAC TATCCACGAG
TTAAATGAAG CTGGAAAACG AAAAGTTCCA GAAGGCGCTC CATTAAGCTT TGTTTCCAAG
CGTTGGGAAA GATATGTATA TGATGAAGAC GGAAGTATTA ATCGACATTT TTACGAATTG
GCCGCATTTA CTGAACTTCG AAATTATGTA CGCTCAGGGG ATATTTCTAT TGTAGGTAGT
AGACAACATA AAGATTTTGA TGAATATCTT ATTCCACAGC AAGAATGGAC AAAGAGTAAA
AATATCGGTA CCAGGTTAGC GGTTCCGATT CAAGTAGAAG AATACATAAA AGAGCGCACT
GAAACCCTTT TGCATCGAAT TCAATCCTTT TCTAAAAATG TAAATTCTCT AGAAGGCGTA
GATCTAGAAA AAGGAATTTT GCGTATTCAT CGCTTAGAAA GAGATGTCCC AGAAGACGCC
AAGAAACTAA GCGCTAAGCT ATACAATATG TTACCACGTA TAAAACTAAC GGATTTATTA
CTAGAAGTGT CGAATTGGAC AAATTTTGAA CAGCAACTGA TCCACGCATC AACAAATAAA
GCGCCTAAAG GCGATGAAAT TATTATATCT CTTGCTGCAA TGATGGCGAT GGGAACTAAT
ATTGGTCTAA CTAAAATGGC AGATGCTACT CCTGGTATCT CCTATCATCA GTTAGCTCAC
GCTAGCCAGT GGAGAATGTA TGATGATGCT TTTCAACGGG CTCAATCCAT ACTCGTAAAC
TTTCAGCATA AAATTCCTTT ATCCTCTTAT TGGGGAGACG GAACTACTTC ATCATCGGAT
GGAATGAGAG TTCAAATTGG GGTCTCTTCT CTCAGTGCTA GTTTCAATCC ACACTATGGG
ACAGGAAAAG GAGCAACCAT TTATCGATTC GTAAGTGATC AATTTTCTTC TTTTTATACA
AAGGTGATTA ACACAAATGC TAGGGATGCT GTACATGTTA TCGATGGATT ATTACATCAT
GAATCTGATT TGGTTATTGA AGAGCATTAC ACAGATACAG CAGGCTATAC AGATCAAGTC
TTTGGATTGG CTCATTTATT AGGTTTTCGC TTTGCGCCTC GGTTACGTGA TTTAGCCACT
TCTAAATTAT ATACGATTGG ATCACCTAAA GAATTCTCAA ATATAGAATC CTTAATACGG
GGACAAATTA ATATGAAGTT AATTTGCGAT AATTATGATG ATGTTTTACG GCTAGCTCAT
TCTATCCGAG AAGGTAAAGT ATCAAGTGCA TTGATTATGG GGAAATTAGG GTCTTATACC
CGTCAAAATA AAGTGGCGAA AGCACTAAGA GAAATAGGGA GAATTGAGAA AACAATTTTC
ATCTTAGATT ATCTTTCAGA TAAAACAATG CGTAGACGTA TTCAACGAGG ATTGAATAAA
GGGGAGGCTA TGAATGCACT TGCTCGAGCT ATTTTCTTTG GAAAACATGG TGAATTACGA
GAAAGAGCGC TACAAGATCA ACTTCAAAGA AGTAGTGCAC TAAACCTCTT AATTAATGCA
ATTAGTGTGT GGAATACTGT TTATTTAAGT GAAGCAATAA ACGTTTTGAA GAGAAAAGAA
AAATTTGATG AAGAACTATT AAAACACATT TCTCCATTAG GATGGGAACA CATTAACTTT
CTAGGAGAAT ACCGATTTAG TAAAAAAGAA ATTGCACCAT TAGACTCTCT ACGACCATTA
CAGATAACTT AG
 
Protein sequence
MRGKELLTPV QREELLHISV ETEHELALHY TFSTEDLEII NQHRRDHNRL GFAVQLCILR 
YPGCTVTNMP TIPEGLLKFV AKQISVDHTV YEAYAKREPT RREHLEEIRK EYGYRNFTIR
DYRRISKFLQ PYALENGNTM YLIQTALQEL RKEKIILPAI PTIERAVWEV RKRTEEKIFK
VLTSSLTSSQ QEKLDRLLHP MPNTSKTYLS WLREIPGQFS PDAFLKVIER LDYIRRLTLK
LDTKGLHPNR IRQLSRIGAR YEPFSFRRFH STKKYAIMIA YLIDLTQDLV DQAFEIHDKQ
IMNLQLKGRK QQEEIQKQNG KSVNEKINHY ANLGTALIKA RQENLDPFLL LETVMPWDTF
VASVEEAKKL SRPMNYDYLD LLESRYNYLR KYTPTLLRTL EFQSTKYANP VLLALDTIHE
LNEAGKRKVP EGAPLSFVSK RWERYVYDED GSINRHFYEL AAFTELRNYV RSGDISIVGS
RQHKDFDEYL IPQQEWTKSK NIGTRLAVPI QVEEYIKERT ETLLHRIQSF SKNVNSLEGV
DLEKGILRIH RLERDVPEDA KKLSAKLYNM LPRIKLTDLL LEVSNWTNFE QQLIHASTNK
APKGDEIIIS LAAMMAMGTN IGLTKMADAT PGISYHQLAH ASQWRMYDDA FQRAQSILVN
FQHKIPLSSY WGDGTTSSSD GMRVQIGVSS LSASFNPHYG TGKGATIYRF VSDQFSSFYT
KVINTNARDA VHVIDGLLHH ESDLVIEEHY TDTAGYTDQV FGLAHLLGFR FAPRLRDLAT
SKLYTIGSPK EFSNIESLIR GQINMKLICD NYDDVLRLAH SIREGKVSSA LIMGKLGSYT
RQNKVAKALR EIGRIEKTIF ILDYLSDKTM RRRIQRGLNK GEAMNALARA IFFGKHGELR
ERALQDQLQR SSALNLLINA ISVWNTVYLS EAINVLKRKE KFDEELLKHI SPLGWEHINF
LGEYRFSKKE IAPLDSLRPL QIT