Gene Dshi_1453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1453 
SymbolssuA2 
ID5712630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1511962 
End bp1512945 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content72% 
IMG OID641267366 
Productputative sulfonate/nitrate transport system substrate-binding protein 
Protein accessionYP_001532796 
Protein GI159044002 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.892723 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0689078 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCCC TTACCCGCCG ATCCACCCTC GCCCTTCTCG GCGCCGCCAC TGGCGCCCTT 
GCCCTGCCCC GCCGCACGGT CGCCGCCCCG ATCCCGCGTC TAGCGCTCTA CGGGCCGCCC
GCCGGTCCGT CGATCACGCT CGCCCATGCG GTCACCGCCG GACTGCTGAC CGACATCGCG
GACGAGACGC GCTTTACCGC CTGGCGCAGC CCCGACGAGT TGCGCGCCGG GCTGACCTCG
GGCGAGATCC TCGCCTCGGT GGTGCCGATC CAGGCGGCGG CGAACCTCTA CAACCGCGGC
TTCCCGATCC GGCTGGCCAA TGCCATGACC AACGGCCTGC TCTATGTCCT CGCCGAAGAT
CCCGGGATCG CGGCGATCCC CGATCTTGCG GGCCGTCACA TCGCCGTGCC CTTCCGCGGC
GACACGCCCG AGATCATTTT CGGCCAGCTT CTCGCCCATT ACGGTCTGGG CCCGGACGAT
CTGCAGATCA CCTATGCCGG TACCCCGACC GAGGCGATGC AGCTGATGCT GGCCGGGCGC
GTCGACGCCG CCCTGACCGC CGAGCCCTCG ACCACGGCGG CGGTGCTGCG GGGGCGCGAG
GCGGGCAAGC AGATCCGGCG CGCGATCAAC CTGCAAAACG CTTGGGGCGA GATGACCGGG
GCCGCCCCCG TCCTGCCGCA GGCGGGACTG GCTCTGACCG GAACCTTCCT CGCGGAGCAT
GGCGAGACGG TGCCTGCGCT TCTGACCGCG CTGGAGCAGG CGACCGCCGA TGTCCTGGCC
AAGCCGCAGG CGGCCGCGGC CCATGCAACG AAGGCCCTCG GCCTGCCAGC GCCGCTTCTG
GCGGCCTCGA TCCCCCATGC GAACCTCGTC GCCCGTCCCG CCACCGAGGC GCGGGCGGAT
ATCGAACGGA TGCTGACGGC CATGGGCGGG ACGGACCTCG CCCGGATCGG CGGCGCCCTG
CCCGACGACG CCTTCTACCT CTGA
 
Protein sequence
MTSLTRRSTL ALLGAATGAL ALPRRTVAAP IPRLALYGPP AGPSITLAHA VTAGLLTDIA 
DETRFTAWRS PDELRAGLTS GEILASVVPI QAAANLYNRG FPIRLANAMT NGLLYVLAED
PGIAAIPDLA GRHIAVPFRG DTPEIIFGQL LAHYGLGPDD LQITYAGTPT EAMQLMLAGR
VDAALTAEPS TTAAVLRGRE AGKQIRRAIN LQNAWGEMTG AAPVLPQAGL ALTGTFLAEH
GETVPALLTA LEQATADVLA KPQAAAAHAT KALGLPAPLL AASIPHANLV ARPATEARAD
IERMLTAMGG TDLARIGGAL PDDAFYL