Gene Dshi_1374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1374 
Symbol 
ID5712550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1427332 
End bp1428321 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content65% 
IMG OID641267286 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_001532717 
Protein GI159043923 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.353892 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.853253 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCGA CGATGATCGG ATTGGGGATC GCCGCCCTTG TGGGGGCAGG CGCCGCGCAA 
GCCGCGGACG AGGTGAAGCT GCAACTGAAA TGGGTCACCC AGGCCCAGTT CGCAGGCTAT
TACGTGGCGC TCGACAAGGG GTTCTACGAG GAGGAGAACC TCGACGTGAC GATCCTGCCG
GGCGGGCCGG ACATCGCGCC GACACAGGTG CTCGCCGGGG GCGGCGCGGA TGTCACCGTC
GAGTGGATGC CTGCCGCGCT CGCCGCCCGG GAAAAGGGCC TGCCGCTGGT CAATATCGCC
CAGCCGTTCA AATCGTCGGG CATGATGCTG ACCTGCTGGA AGGATGCCGG GATCGAGACG
CCCGCCGACC TGTCCAACCG GACCCTGGGC GTGTGGTTCT TTGGCAACGA GTTTCCGTTC
ATGTCCTGGA TGAGCCAGCT CGGCATCTCC ACCGAGGGCA AGTCCGAAAC CGGGGTCGAG
GTGCTCAAGC AGGGCTTCAA TGTCGACCCG CTCCTGCAAC GGCAGGCGGA CTGTATCTCC
ACCATGACCT ATAACGAATA CTGGCAGGTG ATCGATGCGG GCGTGGACCC GGACGAACTG
GTCACCTTCA AGTACGAGGA CCAGGGCGTC GCGACGCTGG AAGACGGGCT CTACGTGCTG
GAAGAGAATC TCTCCGACCC GGCCTTCGAG GACAAGATGG TGCGCTTCGT GCGGGCCTCT
ATGCGCGGCT GGAAATACGC CGAGGAGAAC CCCGAGGAAG CCGCCGAGAT CGTGCTCGAC
AACGACGCGA CAGGGGCGCA GACCGAAAGC CATCAGAAGC GCATGATGGG CGAGGTCGCG
AAGCTGACCG CGGGCTCCAA CGGGGCGCTG GATCCGGCCG ATTACGAACG CACCGTTGCG
ACGCTGCTGG CGGGCGGATC GGACCCGGTG ATCACCAGGC AGCCCGAAGG GGCCTGGACC
CACGCGATCA CGGATGCGGC GCTGAACTGA
 
Protein sequence
MKSTMIGLGI AALVGAGAAQ AADEVKLQLK WVTQAQFAGY YVALDKGFYE EENLDVTILP 
GGPDIAPTQV LAGGGADVTV EWMPAALAAR EKGLPLVNIA QPFKSSGMML TCWKDAGIET
PADLSNRTLG VWFFGNEFPF MSWMSQLGIS TEGKSETGVE VLKQGFNVDP LLQRQADCIS
TMTYNEYWQV IDAGVDPDEL VTFKYEDQGV ATLEDGLYVL EENLSDPAFE DKMVRFVRAS
MRGWKYAEEN PEEAAEIVLD NDATGAQTES HQKRMMGEVA KLTAGSNGAL DPADYERTVA
TLLAGGSDPV ITRQPEGAWT HAITDAALN