Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4683 |
Symbol | |
ID | 5736530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5981095 |
End bp | 5982789 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281847 |
Product | extracellular solute-binding protein |
Protein accession | YP_001547442 |
Protein GI | 159901195 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCGAC GTATTCGTTG GCAAATTGGA GCCACGTTGG TTTGTGCTAG TCTGGTTTTG TTGTTGTTGG CGCATCTTGC GCTAGCAACA ACAGCAGGCG GCGAGGCAGG CGTTGGCGGG GTCTATCGCG AGGCCTTGGT TGGCGAGGTC AATGATCTTA ACCCATTGCG CGGTAGTGCC CAAACCCGCG CCGAGGCCGA CTTAAGTGCT TTGTTGTTTG AAGGCTTGAC CCGCGTTGAG GCAACTGGCG TGGTCGTGCC CGATCTCGCC GAAAATTGGC AGGTCAGCAG CAATAATCTG GTTTATACCA TGACCTTGCG CACTGATGTG CGCTGGCACG ATGCCACACC TTTTTCTGCA AACGATGTGA TCTGGACGAT CGACTGGATC AAAAGCCCAC AATTTAATGG CGACCCCAGC CTTAAGTTAG CTTGGCAAAA TATCGCCGTC ACCGCGTTGA ATCCGCAAAC TGTGCGTTTT AACTTGCCAG TTGCTTTTGC GCCCTTCCTC AACCAACTAA CTGTGCCAAT TTTGCCCGCG CATCTGTTGC GCAACGCTAC GCCCGACCAA TGGCAAGCTT GGAATCAGCA ACCAGTTGGC ACTGGCAGTT TTCGCTTTGG TAAACGCGAT GGCAACTTAA TCGAGCTATT GGCCAACACT GATTACCGTC CGAATATTCC CAATAGTCGC CCCAATTTGG AGCGGCTGGA GTTTCGGCTT TACGCAACCA TGGAAGCTGC GCATGAAGCA CTGCGCCGTG ATTTGGTCGA TGCGTTTGCC TACGATGTGA CTGAACAAGG CGAATTACCG TTGCCAATCG GCTATCGCCA GGTGCGTGCG CCTTTGGCTG ATTATGTGGT CCTGACGATT AATTTGCGTC GTCCGCCACT TGATGATTTA CGGGTGCGCC AAGCGATCGA TTATGCAGTT GACCCAACTA CGTTGATTAC TGAAGTGTTG CAAAATCGGG CGTTGCCCTT GAATTCGCCA ATTTTGCCTT CGTCGTGGGC GGTTGCTGAG GGTCTCACCG GCTATATTGA TGATGCTGAG CAGAGTCGCG CCAATGGGTT GCTTGATCAA GCTGGTTGGC AGCGTGATGC CGAAGGGTGG CGGCGTAAAG CTGGCCAATT GTTGCGCTTC GATTTGGTCA GTGCCAACAG CCCCGAACGC AGCGCCATCG CCGAACGCAT TCAAGCCCAA TTAGCCACGG TCGGTATTTC AAGCAGTTTG CAATTGGTTC CGAGCAGCAA TTTGCAAGAT ACGCTGAGCC AACATACCTT CGATTTAGCA ATTCATGGTT GGTCGAATTT AGGCAGCGAC CCCGATGTTT ACGAGTTATG GCATTCGTCG CAGGTCAACG CCGCCAACTT TGCTGGCCTC GAAGACCCAG CAATTGATGA TTTATTGGTG CGTGCTCGCC AAACCACCAG CCAAGAAACC CGCCGCGAAT TGTATGCTGA ATTTCAGGCC AAATGGCTAG CACTCGCGCC CTCAGTGATG CTCTATCAGC CGTTGTTGGA GCAGCAGGTT GGACCATCGA TCGAGGTGCT TGGTCTGGCT CCAATCAACC AACAAGCTGA AGTGCTCTAT CGGCATAGCG ATCGCTTCCG GCGGATCGCC AATTGGTATC TAGTATCCAC CCGTCAAGTC TTGCCCAACT TCCGAAATGA ACCACAAAAC GAACGCCCAC GCTAA
|
Protein sequence | MVRRIRWQIG ATLVCASLVL LLLAHLALAT TAGGEAGVGG VYREALVGEV NDLNPLRGSA QTRAEADLSA LLFEGLTRVE ATGVVVPDLA ENWQVSSNNL VYTMTLRTDV RWHDATPFSA NDVIWTIDWI KSPQFNGDPS LKLAWQNIAV TALNPQTVRF NLPVAFAPFL NQLTVPILPA HLLRNATPDQ WQAWNQQPVG TGSFRFGKRD GNLIELLANT DYRPNIPNSR PNLERLEFRL YATMEAAHEA LRRDLVDAFA YDVTEQGELP LPIGYRQVRA PLADYVVLTI NLRRPPLDDL RVRQAIDYAV DPTTLITEVL QNRALPLNSP ILPSSWAVAE GLTGYIDDAE QSRANGLLDQ AGWQRDAEGW RRKAGQLLRF DLVSANSPER SAIAERIQAQ LATVGISSSL QLVPSSNLQD TLSQHTFDLA IHGWSNLGSD PDVYELWHSS QVNAANFAGL EDPAIDDLLV RARQTTSQET RRELYAEFQA KWLALAPSVM LYQPLLEQQV GPSIEVLGLA PINQQAEVLY RHSDRFRRIA NWYLVSTRQV LPNFRNEPQN ERPR
|
| |