Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4035 |
Symbol | |
ID | 5735897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5150931 |
End bp | 5151827 |
Gene Length | 897 bp |
Protein Length | 298 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281186 |
Product | binding-protein-dependent transport systems inner membrane component |
Protein accession | YP_001546795 |
Protein GI | 159900548 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0395] ABC-type sugar transport system, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTCTA ATTCGCTTGT GAGTGAACCA TCAATGGCGC AACCCAAAGC CTCACCAAAG GCACTACGCC CGAGCCGCTG GCTCACACCA TTGCTGGTGC ATGCGGTGTT GGGCAGCTAT ACCCTATTGG CGATTGCGCC GGTGGTTTTG GTGGTGATGA ATTCATTCAA AGGCTCGCAA GCAATTTTTC AACAACCCTA CTCGCTGCCA GTTGGCGCAA ATTTTGATCC GGTTGGCTAT ACCACCGTCT TTGAACAAAC GGCAATCTTT CGCTATCTCG GCAATAGCTT GATGGTGACG CTTGGCTCAA TTTTGCTGAT TTTGCTGTTT GGTGCAATGA CTGCCTTTGG TTTGACCGAA TATCGTTTTC GCGGGCGCGG CTTTTTGACC TTATATGCCC TGATTGGCTT GATGATTCCG ATTCGGTTGG CAACCGTGAG TATTCTCAAA TTGATGGTTA CCCTGAATCT GCAAGATACG ATCTGGTCAT TGATTATGGT GTATAGCGCT CAAGGCTTGC CGATGGCAAT TTTTGTGCTT TCGCAATATA TGCGCCAAGT GCCAACCGAC CTCAAGGATG CGGCGCGGCT GGATGGAGCC AACGAATATC AAGTATTTTG GTTGGTTCTG CCCTTGATGC GCCCAGCCCT CGCGACCTTG GCAATTTTCG TGATGCTGCC AATTTGGAAT GATCTGTGGT TTCCCTTGGT TTTAGCTCCT GGTGAGGCTT CACGCACGCT CACGCTAGGC GCACAACAAT TCCTTGGTCA ATTCCAAACC GATTGGAGCG CCTTGCTAGC GATGCTGACC TTGGCAATTG CCCCGGTTTT AGTGCTCTAC CTCATTTTCT CGCGCCAACT GATTCGTGGC CTCACTGCTG GTACGGTCAA GGGGTAA
|
Protein sequence | MSSNSLVSEP SMAQPKASPK ALRPSRWLTP LLVHAVLGSY TLLAIAPVVL VVMNSFKGSQ AIFQQPYSLP VGANFDPVGY TTVFEQTAIF RYLGNSLMVT LGSILLILLF GAMTAFGLTE YRFRGRGFLT LYALIGLMIP IRLATVSILK LMVTLNLQDT IWSLIMVYSA QGLPMAIFVL SQYMRQVPTD LKDAARLDGA NEYQVFWLVL PLMRPALATL AIFVMLPIWN DLWFPLVLAP GEASRTLTLG AQQFLGQFQT DWSALLAMLT LAIAPVLVLY LIFSRQLIRG LTAGTVKG
|
| |