Gene Spro_3066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_3066 
Symbol 
ID5604274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp3375170 
End bp3376378 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content48% 
IMG OID640938607 
ProductIS605 family transposase OrfB 
Protein accessionYP_001479295 
Protein GI157371306 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGAC TTCAAGCCTT CAAATTCCAG TTAAGACCCA ATGGTCAGCA GGTCCGCGAT 
ATGCGGCGCT TCGCTGGGGC ATGTCGTTTT GTATTCAATA AATCACTCGC CTTGCAGAAC
GAAAATCATG AGACAGGTAA TAAATACCTT CCCTATGTGA AAATGGCGGC ATGGTTGGTT
GAGTGGAAAA AAATGCCTGA CACAGCATGG TTGAAAGAAG CACCTTCCCA GCCTTTGCAG
CAGGCGTTAA AAGACCTCGA ACGCGCCTAC AAAAATTTTT TCCAGAAACG GGCATCATTC
CCCCGGTTTA AAAAACGCGG TCAAAGTGAC ACATTGCGCT ACCCGCAGGG TGTGAAACTG
GATCAAGAAA ACAACCGCAT ATCGCTGCCC AAACTTGGCT GGATACACTA TCGCAATAGT
CGTCAAATTG TGGGCGAGGT TAAAAATGTC ACGGTTAGCC AGTCATGTGG CAAGTGGTAT
ATCAGTGTCC AGACTGAATA TGAAGCTGAT GAATCGGTTC ATACGTCAAC TTCTATGGTT
GGGCTTGATG CTGGCGTAGC GAAGCTCGCC ACGCTATCAG ATGGCACTAT TTTTGAGCCA
GTAAACAGCT TTAAAATCAA CCAAAATAAA CTCGCCAGGC TCCAACGTGA AATGAACCGT
AAGGTGAAAT TCAGCAACAA CTGGAAGAAA GCAAAACGTA AAGTACAAAA TCTGCATTCC
CGTATCGGTA ATATCCGCCG CGACTACCTC CATAAAGTCA GCACGACAAT CAGCAAAAAC
CACGCGATGA TCGTCATTGA GGACTTGAAG GTATCAAATA TGTCAAAGTC AGCAGTGGGT
ACTGAAAGCC AGCCAGGGGG CAATGTCCGG GCAAAATCAG GCTTAAACCG TTCGATACTG
GATCAGGGTT GGTACGAACT CCGTCGGCAG CTTGAGTACA AGCAGTTCTG GCGAGGTGGT
CAGGTACTGG CAATCAATCC AGCCTACACC AGTCAAAAAT GCGCCTGCTG TGGTCATACA
GCGAAAGAAA ACCGGCAATC ACAAAGTCAG TTCGAGTGTC TGGTATGTGG GTACACCGCG
AACGCCGATA TAAATGGCGC CCGCAATATT TTAGCGGCAG GGCATGTCGC GTTAGCCTGT
GGAGAGATGG CAGCTTTAGG CCGCTCTATG AAGCAGGAAC CCACCGAGGC GAGTCAGACT
TCGGTCTGA
 
Protein sequence
MKRLQAFKFQ LRPNGQQVRD MRRFAGACRF VFNKSLALQN ENHETGNKYL PYVKMAAWLV 
EWKKMPDTAW LKEAPSQPLQ QALKDLERAY KNFFQKRASF PRFKKRGQSD TLRYPQGVKL
DQENNRISLP KLGWIHYRNS RQIVGEVKNV TVSQSCGKWY ISVQTEYEAD ESVHTSTSMV
GLDAGVAKLA TLSDGTIFEP VNSFKINQNK LARLQREMNR KVKFSNNWKK AKRKVQNLHS
RIGNIRRDYL HKVSTTISKN HAMIVIEDLK VSNMSKSAVG TESQPGGNVR AKSGLNRSIL
DQGWYELRRQ LEYKQFWRGG QVLAINPAYT SQKCACCGHT AKENRQSQSQ FECLVCGYTA
NADINGARNI LAAGHVALAC GEMAALGRSM KQEPTEASQT SV