Gene Dshi_0491 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_0491 
SymbolssuA1 
ID5711407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp476139 
End bp477122 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content68% 
IMG OID641266395 
Productputative sulfonate/nitrate transport system substrate-binding protein 
Protein accessionYP_001531840 
Protein GI159043046 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00141878 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.493343 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTTC TTTCCCGCCG ATCCACTCTC GCCCTGCTGG GCACGGCCGC TGGCGCCCTC 
GCCCTGCCCC GCAGCGCCGC GGCACAGCCG ATCCCACGGC TGGCGCTCTA CGGGCCGCCG
GCCGGCCCCT CGATCACCTT GGCCCATGCG GTCAAGACCG GAATGCTGTC CGACATCGCC
GAGGAGACGC TCTTTACCCC ATGGCGCAGC CCCGACGAGT TACGGGCGGG GCTGACTTCG
GGCGAAATCC TTGTGTCCGT GGTGCCGATT CAAGCGGCCG CGAACTTCTA CAATCGCGGC
TTCCCGATCA GGCTGGAAAA CGCGATGACC AATGGCCTGC TCTACATCAT CGCCGAGGAA
ACAGGGATCG CGACGATCCC CGATCTCGCG GGTCGTCACA TCGCGGTGCC GTTCCGCGGC
GATACCCCGG AGATCATTTT CAGCCAACTC CTCGACCATC ATGGGATGCG TGCCGAAGAT
CTGAAAATCA CCTATGCGGG CACGCCCACC GAAGCCATGC AATTGATGCT GGCGGGCCAG
GTCGATGCCG CCCTTACCGC CGAGCCCTCG ACCACCGCGG CGGTGCTGCG CGGGCGCGAG
GCGGGCAAGC AGATCCGTCG CGCGATCAAC CTGCAGGCCG TCTGGGGCGA GATGACCGGG
GCCGCGCCGG TGCTGCCGCA GGCGGGGCTG GCACTGACGC CAACCTTCCT CGACACTTAC
GGTGACGCGG TTCCCGCCCT TCTGGCTGCG CTTGAACAGG CGACCGCCGA CGTTCTGGCC
AACCCCGAGG CAGCCGCGGC CCATGCCACC GAGGCGCTCG GCCTGCCCGC ACCGCTTCTG
GCGGCCTCGA TCCCCAATTC GAACCTGGTC GCCCGTCCGG CCAACGAAGC GCGGGCCGAC
ATCGAACGCC TGCTGGCGGC AATGGCGGGC CCGGATCTCG CTCGCATCGG CGGTGCGATG
CCGGACGACG CCTTCTATCT GTAA
 
Protein sequence
MTFLSRRSTL ALLGTAAGAL ALPRSAAAQP IPRLALYGPP AGPSITLAHA VKTGMLSDIA 
EETLFTPWRS PDELRAGLTS GEILVSVVPI QAAANFYNRG FPIRLENAMT NGLLYIIAEE
TGIATIPDLA GRHIAVPFRG DTPEIIFSQL LDHHGMRAED LKITYAGTPT EAMQLMLAGQ
VDAALTAEPS TTAAVLRGRE AGKQIRRAIN LQAVWGEMTG AAPVLPQAGL ALTPTFLDTY
GDAVPALLAA LEQATADVLA NPEAAAAHAT EALGLPAPLL AASIPNSNLV ARPANEARAD
IERLLAAMAG PDLARIGGAM PDDAFYL