Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_0491 |
Symbol | ssuA1 |
ID | 5711407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 476139 |
End bp | 477122 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641266395 |
Product | putative sulfonate/nitrate transport system substrate-binding protein |
Protein accession | YP_001531840 |
Protein GI | 159043046 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.00141878 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.493343 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTTTC TTTCCCGCCG ATCCACTCTC GCCCTGCTGG GCACGGCCGC TGGCGCCCTC GCCCTGCCCC GCAGCGCCGC GGCACAGCCG ATCCCACGGC TGGCGCTCTA CGGGCCGCCG GCCGGCCCCT CGATCACCTT GGCCCATGCG GTCAAGACCG GAATGCTGTC CGACATCGCC GAGGAGACGC TCTTTACCCC ATGGCGCAGC CCCGACGAGT TACGGGCGGG GCTGACTTCG GGCGAAATCC TTGTGTCCGT GGTGCCGATT CAAGCGGCCG CGAACTTCTA CAATCGCGGC TTCCCGATCA GGCTGGAAAA CGCGATGACC AATGGCCTGC TCTACATCAT CGCCGAGGAA ACAGGGATCG CGACGATCCC CGATCTCGCG GGTCGTCACA TCGCGGTGCC GTTCCGCGGC GATACCCCGG AGATCATTTT CAGCCAACTC CTCGACCATC ATGGGATGCG TGCCGAAGAT CTGAAAATCA CCTATGCGGG CACGCCCACC GAAGCCATGC AATTGATGCT GGCGGGCCAG GTCGATGCCG CCCTTACCGC CGAGCCCTCG ACCACCGCGG CGGTGCTGCG CGGGCGCGAG GCGGGCAAGC AGATCCGTCG CGCGATCAAC CTGCAGGCCG TCTGGGGCGA GATGACCGGG GCCGCGCCGG TGCTGCCGCA GGCGGGGCTG GCACTGACGC CAACCTTCCT CGACACTTAC GGTGACGCGG TTCCCGCCCT TCTGGCTGCG CTTGAACAGG CGACCGCCGA CGTTCTGGCC AACCCCGAGG CAGCCGCGGC CCATGCCACC GAGGCGCTCG GCCTGCCCGC ACCGCTTCTG GCGGCCTCGA TCCCCAATTC GAACCTGGTC GCCCGTCCGG CCAACGAAGC GCGGGCCGAC ATCGAACGCC TGCTGGCGGC AATGGCGGGC CCGGATCTCG CTCGCATCGG CGGTGCGATG CCGGACGACG CCTTCTATCT GTAA
|
Protein sequence | MTFLSRRSTL ALLGTAAGAL ALPRSAAAQP IPRLALYGPP AGPSITLAHA VKTGMLSDIA EETLFTPWRS PDELRAGLTS GEILVSVVPI QAAANFYNRG FPIRLENAMT NGLLYIIAEE TGIATIPDLA GRHIAVPFRG DTPEIIFSQL LDHHGMRAED LKITYAGTPT EAMQLMLAGQ VDAALTAEPS TTAAVLRGRE AGKQIRRAIN LQAVWGEMTG AAPVLPQAGL ALTPTFLDTY GDAVPALLAA LEQATADVLA NPEAAAAHAT EALGLPAPLL AASIPNSNLV ARPANEARAD IERLLAAMAG PDLARIGGAM PDDAFYL
|
| |