Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1064 |
Symbol | |
ID | 5732968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1216770 |
End bp | 1217891 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641278199 |
Product | extracellular solute-binding protein |
Protein accession | YP_001543840 |
Protein GI | 159897593 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0687] Spermidine/putrescine-binding periplasmic protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTAC GTCGCTTTGG GCTGGTGCTC ATGGTGTTGA GTTTGTTGGT GGCCTGTGGC GAAGCCAGCT CAACCGCTGT TCCGACAACT GCGCCTGCTG GTACGGCCAC TGGCTCGAAT GCTGGCTCGG TTGATCGCAG CAAATTGAGC AAAACCTTGC GAATTTATGC GTGGGGCGAG TATGTTCCCG AAGATGTGCC ACAACTGTTT GAGAGTGAGT TTGGGGTCAA GGTTACGGTT GATACCTATT CATCGAACGA AGAAATGGCC GCCAAAATTC GCGCTGGCAA TTCAGGCTAC GATTTGATTC AACCTTCAGA TTATATGGTG GCGTTGTTGG CCGAGGGCAA TTATTTGGCC AAAATTGATT TGGCTAATAT TCCCAACATT GCCAATATCG ACCCAGCCAA TATGGGTTTG TATTACGACC CAAACAATGA ATTTTCAGTA CCATACCTTT GGGGAACCAC GGGGATTGCC TACGATAAAA CCGCAGTTTC ACCAGCCCCA ACTAGCTGGT CGATTTTGTT TGATCCAGCG CAATTGAGCG CCTACAAAGG TCGCGTGAGC ATGCTCAACG ACGAGCGCGA GGTAATTGGC GCGGCTATGC TGTTCTTGGG CAAAGATCGC AATTCCAGCG ATGCGGCGGA TCTCGAAGCC GCCAAAAAGG TTTTGATTGA ACAAAAGCCA TTGCTAGCCA AATATAACAG CGATAATGTT TATCAAGATT TGGCTTCGGG CGAAGTGGTT TTGGCCCAAT CGTGGAATAA TTACACGGGT TTGGCCATGA TCGATAATGA AAACATCGAG TGGGTGATTC CGCAAGAGGG TGGCGTGATT TGGCAAGATA CCATGGCGAT TGTGGCTGGC ACGCCCAACC AATATACTGC CGAAGTATTC ATTGATTTTA TGAATCGGCC AGAAATTGCC GCCAAAGTTG CCGACTTTAC TGGTGCTTTG ACTCCCAACG TCAAGGGCGA ACCATTGATC GGCGACGATC TCAAGGCTGT CTATCCCAAG ATCAAACCGA GCGCTGAAGA TCGCAAACGG CTTGATTGGT TGCGTAAAGG CCAAAATGCG ACGGCCTTCT CCGATGTGTG GTCGGCGGTT AAATCGCAAT AA
|
Protein sequence | MKLRRFGLVL MVLSLLVACG EASSTAVPTT APAGTATGSN AGSVDRSKLS KTLRIYAWGE YVPEDVPQLF ESEFGVKVTV DTYSSNEEMA AKIRAGNSGY DLIQPSDYMV ALLAEGNYLA KIDLANIPNI ANIDPANMGL YYDPNNEFSV PYLWGTTGIA YDKTAVSPAP TSWSILFDPA QLSAYKGRVS MLNDEREVIG AAMLFLGKDR NSSDAADLEA AKKVLIEQKP LLAKYNSDNV YQDLASGEVV LAQSWNNYTG LAMIDNENIE WVIPQEGGVI WQDTMAIVAG TPNQYTAEVF IDFMNRPEIA AKVADFTGAL TPNVKGEPLI GDDLKAVYPK IKPSAEDRKR LDWLRKGQNA TAFSDVWSAV KSQ
|
| |