Gene Dshi_4035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_4035 
Symbol 
ID5714564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009957 
Strand
Start bp99093 
End bp100121 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content60% 
IMG OID641276947 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_001542243 
Protein GI159046573 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.327975 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTGT TTGTCGGATT GGATGTGTCG CTTGCGAAGA CTTCGGTCTG CGTGATCAGC 
GAGTACGGCA AGATTATCAA AGAGGCAGAG ACTGAAAGCG AACCCGAGGT TCTGGCGCGC
TGGCTGCATG ATCTGGACGG CAGCATCGCG GCGATTGGCC TGGAGGCTGG GCCTCTGTCG
CAATGGCTGC ACCGAGGGCT GACCGAAGCT GGCCTTGATA CGGTGCTCAT GGAAACGCGC
CAAGTGAAAG GAGCGCTGAA GGCGATGCCG ATCAAGACGG ATCGGCGCGA TGCAGAAGGG
ATTGCACGCC TTCTTCATCT CGGCTGGTTC CGCCCGGTCC ACTGTAAATC CGTGTCTGCT
CAGGAAACCC GGGCGGTTCT TGGCGCTCGA AAGGCTATCC AGCAGAACAT GATCGCTCTG
GAAATGTCGT TGCGCGGACT CCTGCGGAAC TTTGGCCTCA AGGTCGGCGC GATCTCCCGT
GGCAGGTTTG AGACACGCAT TCGGGAGTTG GCAGATGGCA ACCCGATGCT GGAAACCGCG
ACAGACCCGA TGCTGCGGGC CCGGGCGACC CTACGGCAGG AACTGGCCGG GCTCGAAGAA
CGCGTGCGCC AGTTGGCCTG GGATGATCAG GTTTGCCAAC GGCTTATGTC GATGCCTGGA
ATCGGTGCGG TCGTAGCACT TACATTCCGT GCTGCGGTCG ATGATCCTGC CCGCTTTCGG
TCTTCAAAGA GAATTGGCCC CTGGGTTGGC CTGACGCCCT CACGCAACCA GTCCGGTGAA
CGAGACGTGT CAGGCGGCAT CACCAAGGCT GGTGACGTCA ATCTGAGGCG AACATTGTGC
CAGGCAGCAA CCGTCATGAT GAATCGCGGC CGATCGACAT GGCTGAGAAC ATGGGGAGCC
CAGCTCGCGC AGCGGCGTGG TCGCAAAATC GCGATGGTCG CCCTCGCACG CCGCATCGCT
GTCATCCTCC ATCGGATTTG GGTCGATGGC ACAACCTTCC AGCCAGATGC CGCGCCGAAC
CTTGCCTGA
 
Protein sequence
MKLFVGLDVS LAKTSVCVIS EYGKIIKEAE TESEPEVLAR WLHDLDGSIA AIGLEAGPLS 
QWLHRGLTEA GLDTVLMETR QVKGALKAMP IKTDRRDAEG IARLLHLGWF RPVHCKSVSA
QETRAVLGAR KAIQQNMIAL EMSLRGLLRN FGLKVGAISR GRFETRIREL ADGNPMLETA
TDPMLRARAT LRQELAGLEE RVRQLAWDDQ VCQRLMSMPG IGAVVALTFR AAVDDPARFR
SSKRIGPWVG LTPSRNQSGE RDVSGGITKA GDVNLRRTLC QAATVMMNRG RSTWLRTWGA
QLAQRRGRKI AMVALARRIA VILHRIWVDG TTFQPDAAPN LA