Gene Dshi_3759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3759 
Symbol 
ID5714288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009955 
Strand
Start bp162514 
End bp163542 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content60% 
IMG OID641276674 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_001541970 
Protein GI159046298 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.076485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.271017 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTGT TTGTCGGATT GGATGTGTCG CTTGCGAAGA CTTCGGTCTG CGTGATCAGC 
GAGTACGGCA AGATTATCAA AGAGGCAGAG ACTGAAAGCG AACCCGAGGT TCTGGCGCGC
TGGCTGCATG ATCTGGACGG CAGCATCGCG GCGATTGGCC TGGAGGCTGG GCCTCTGTCG
CAATGGCTGC ACCGAGGGCT GACCGAAGCT GGCCTTGATA CGGTGCTCAT GGAAACGCGC
CAAGTGAAAG GAGCGCTGAA GGCGATGCCG ATCAAGACGG ATCGGCGCGA TGCAGAAGGG
ATTGCACGCC TTCTTCATCT CGGCTGGTTC CGCCCGGTCC ACTGTAAATC CGTGTCTGCT
CAGGAAACCC GGGCGGTTCT TGGCGCTCGA AAGGCTATCC AGCAGAACAT GATCGCTCTG
GAAATGTCGT TGCGCGGACT CCTGCGGAAC TTTGGCCTCA AGGTCGGCGC GATCTCCCGT
GGCAGGTTTG AGACACGCAT TCGGGAGTTG GCAGATGGCA ACCCGATGCT GGAAACCGCG
ACAGACCCGA TGCTGCGGGC CCGGGCGACC CTACGGCAGG AACTGGCCGG GCTCGAAGAA
CGCGTGCGCC AGTTGGCCTG GGATGATCAG GTTTGCCAAC GGCTTATGTC GATGCCTGGA
ATCGGTGCGG TCGTAGCACT TACATTCCGT GCTGCGGTCG ATGATCCTGC CCGCTTTCGG
TCTTCAAAGA GAATTGGCCC CTGGGTTGGC CTGACGCCCT CACGCAACCA GTCCGGTGAA
CGAGACGTGT CAGGCGGCAT CACCAAGGCT GGTGACGTCA ATCTGAGGCG AACATTGTGC
CAGGCAGCAA CCGTCATGAT GAATCGCGGC CGATCGACAT GGCTGAGAAC ATGGGGAGCC
CAGCTCGCGC AGCGGCGTGG TCGCAAAATC GCGATGGTCG CCCTCGCACG CCGCATCGCT
GTCATCCTCC ATCGGATTTG GGTCGATGGC ACAACCTTCC AGCCAGATGC CGCGCCGAAC
CTTGCCTGA
 
Protein sequence
MKLFVGLDVS LAKTSVCVIS EYGKIIKEAE TESEPEVLAR WLHDLDGSIA AIGLEAGPLS 
QWLHRGLTEA GLDTVLMETR QVKGALKAMP IKTDRRDAEG IARLLHLGWF RPVHCKSVSA
QETRAVLGAR KAIQQNMIAL EMSLRGLLRN FGLKVGAISR GRFETRIREL ADGNPMLETA
TDPMLRARAT LRQELAGLEE RVRQLAWDDQ VCQRLMSMPG IGAVVALTFR AAVDDPARFR
SSKRIGPWVG LTPSRNQSGE RDVSGGITKA GDVNLRRTLC QAATVMMNRG RSTWLRTWGA
QLAQRRGRKI AMVALARRIA VILHRIWVDG TTFQPDAAPN LA