Gene Dshi_2138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2138 
Symbol 
ID5713134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2267574 
End bp2268803 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content70% 
IMG OID641268060 
Producthypothetical protein 
Protein accessionYP_001533475 
Protein GI159044681 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0739] Membrane proteins related to metalloendopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.123403 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.331255 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCAGCG ACCCGAAAAG AGGCGCGCCA CGGCCCGACA AGAGCAGAAG GCTTGAGCCC 
ATGACCGACA TCTCCCAGCC TCCCAAGTGC GGCGCGCGGC GCACCGCCTC GGTATTGGCG
ATCTGTAGCG CCCTGGCCCT GGCGGGCTGC GACACCTCGA CCATGGATTT CGACATGCGG
GACGCATTCG GCGGCCCGTT CTCGACTGCC GGCGCAGTGG GGGCGCCCGC CGCCCCACGG
CCCGAGCCCG ATGAGCGCGG GGTGATTTCC TATGCCGACT ATCAGGTGGT CCTGTCGCGG
CAGGGCGACA CCGCCAACAC CATCGCCGCG CGGCTCGGGG TCGACCCGGG CGCGCTGGCA
CGGTTCAACG GGCTGAACCC GGAAACCGTG CTGCGAGAGG GCGAGGTTCT GGCCCTGCCG
ACCCGGGTGG CAGAGCCTGT GGGCACCACG GATATCGCGG TCCTCGCTTC GGGCGCGATC
GAACGGGCGG AGCTGCCCGA AGGCACCAGC AGCGTAGACA CCCCGCTGGT GCGCGCCGTG
CCCTCGGCGG AACTGTCCGG CCCGACACCA ATGCAACACA AGGTGCAGCG CGGGGAGACA
GCCTATTCCA TCGCGCGGCT CTACGACATC TCGGTGCGAG CTTTGGCGGA CTGGAACGGA
CTGGGTCCGG ATCTCGCGGT GCGCGAGGGA CAAGTTCTGC TGGTGCCGCT GACCACAGCG
GCTGCGGCCC CTGCCGCCGC CCCCGAGCAG CCCGCGCCCG GGCAGGGTAG CGTCGCGCCG
GTCCCACCGA GTGCCAGTAC CCCCCTGCCC GAGCCGGAAC CCGTGGCCGC CGTGCCCGAC
GCGCCCGACA TGGGGCAGTT CCAGACCGAG GCCTCGGACA GTGCGAGCTT CGCCTTCCCG
GTCGCGGGCC GGATCATCCG CGATTACGAC AAGGGCCGGA ACGACGGCAT CGGCATCGCT
GCGGACCCGG GCACGCCCGT GGTGGCGGCG GGCGACGGCG AGGTCGCCGC GATCACCCGT
GATACGGACC AGGTGCCGAT CCTGGTGCTG CGGCACCCGG ACAACCTGCT GACGGTCTAC
GCCAATGTGG GAGATATCGC CGTGGAGAAG GGCGATACCG TGCGTCGGGG ACAGCAGGTC
GCGACCGTGG CCACGGGCGA TCCGTCCTTC CTGCATTTCG AGATCCGCGA AGGGATCGAA
AGCGTCGACC CGGTGCCCTA CCTGAATTGA
 
Protein sequence
MRSDPKRGAP RPDKSRRLEP MTDISQPPKC GARRTASVLA ICSALALAGC DTSTMDFDMR 
DAFGGPFSTA GAVGAPAAPR PEPDERGVIS YADYQVVLSR QGDTANTIAA RLGVDPGALA
RFNGLNPETV LREGEVLALP TRVAEPVGTT DIAVLASGAI ERAELPEGTS SVDTPLVRAV
PSAELSGPTP MQHKVQRGET AYSIARLYDI SVRALADWNG LGPDLAVREG QVLLVPLTTA
AAAPAAAPEQ PAPGQGSVAP VPPSASTPLP EPEPVAAVPD APDMGQFQTE ASDSASFAFP
VAGRIIRDYD KGRNDGIGIA ADPGTPVVAA GDGEVAAITR DTDQVPILVL RHPDNLLTVY
ANVGDIAVEK GDTVRRGQQV ATVATGDPSF LHFEIREGIE SVDPVPYLN