Gene Dshi_3878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3878 
Symbol 
ID5714407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009956 
Strand
Start bp100522 
End bp101904 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content59% 
IMG OID641276791 
Producttransposase IS4 family protein 
Protein accessionYP_001542087 
Protein GI159046416 
COG category[L] Replication, recombination and repair 
COG ID[COG3666] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.726843 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGGGC CAAGGCAGGA AGCACAGGCG GCACTGTTTT ACGAGTTTTC GCTGGAGGAG 
CATGTCCCGC AGGACCACCT TTTGAGATCG ATTGATCGGC ATCTCGATCT GAGCAGCATC
CGGGGGCATT TGGCAGATTT CTATAGCCAC ACGGGGCGTC CATCTGTCGA TCCTGAGCTG
ATGATCCGGA TGCTGTTGGT CGGATACTGT TTTGGCATCC GGTCAGAGCG GCGGCTCTGC
GAAGAGGTGC ATCTGAACCT GGCATACAGA TGGTTCTGCC GCCTTGAACT GACAGACCGC
ATCCCGGACC ATTCGACATT TTCCAAGAAC CGGCACGGCC GCTTCCGTGA CAGTGACCTC
TTGCGTCATG TGTTCGAGGC GACTGTTGCG CGCTGCATTG AAGAGGGTTT GGTCGGCGGC
CAGGGCTTTG CGGTCGATGC CAGCCTGATC AGCGCGGATG TCCAGAAGCA GAACTCGAGC
AATCCCGAAG GCTGGGCGGC CCGCGAGATT GATCCCACGG ATGCGCCCCG CGCGGTGCGG
GAGTATCTCG ACACTTTGGA CGATGAAGCC TTCGGTGCAG CGACAACAGC AAAACCCAAG
TTCACCGCCC ATGCCGATCC GGCCAGTCAA TGGACGGCTG CGCGCAAAGG GCCTGCATTC
TTTGCCTATT CTGACAACTA CCTGATCGAC ACCGATCACG GGATTATCGT TGACGTGGAC
GCCAGCCGGT CGAACAAGAC CGCCGAGGTC GGTGCCATGC GGAAGATGCT CGACCGGACC
GAAGACCGGT TTGGCGTGAA GCCCGATTGG ATCGCTGCTG ACACCGCCTA CGGATCGTCA
GACAACCTGG TCTGGCTGGC ACTCAAGCGC CAGATCCTCC CCTTCATCCC TGTCTTTGAT
AAAGGTGAAC GGACCGACGG AACCTTCTCG CGGTCCGACT TCACGTGGGA TGACGAGAAC
GATCGCTACA TCTGCCCGAG TGGAAAGGAG ATGCGCCACA CATGGCGGAC CTATTCCGAT
CCCGCGCGAA ATGCACCAGC TTGGAAAGCC CGCAGATATC GGACGCGGAA GTCTGATTGC
ACGGGATGTG CGCTGAAGGC CAAATGCTGC CCCAACTCGG AGGTCCGTGC GATCCATCGC
GAGAAATATG AGATCGTCCG AGACTTCGCC CGCCAATGCA CCGCCTCAGA GTACAATCCA
ACTGCCCAGA GGCGGCGAAA GAAAGTAGAG ATGCTCTTTG CCCACCTTAA ACGCATCCTC
GGCCTGGGCC GGCTCCGATT ACGTGGCCCA TGCGGCGTCC AAGACGAGTT TACCCTCGCA
GCCACCGCCC AAAACCTTCG GAAACTAGCA AAACTCAAAC CCATGGTGCC GGCCACAGAA
TGA
 
Protein sequence
MMGPRQEAQA ALFYEFSLEE HVPQDHLLRS IDRHLDLSSI RGHLADFYSH TGRPSVDPEL 
MIRMLLVGYC FGIRSERRLC EEVHLNLAYR WFCRLELTDR IPDHSTFSKN RHGRFRDSDL
LRHVFEATVA RCIEEGLVGG QGFAVDASLI SADVQKQNSS NPEGWAAREI DPTDAPRAVR
EYLDTLDDEA FGAATTAKPK FTAHADPASQ WTAARKGPAF FAYSDNYLID TDHGIIVDVD
ASRSNKTAEV GAMRKMLDRT EDRFGVKPDW IAADTAYGSS DNLVWLALKR QILPFIPVFD
KGERTDGTFS RSDFTWDDEN DRYICPSGKE MRHTWRTYSD PARNAPAWKA RRYRTRKSDC
TGCALKAKCC PNSEVRAIHR EKYEIVRDFA RQCTASEYNP TAQRRRKKVE MLFAHLKRIL
GLGRLRLRGP CGVQDEFTLA ATAQNLRKLA KLKPMVPATE