Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2268 |
Symbol | |
ID | 5734155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2901520 |
End bp | 2902761 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641279409 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_001545036 |
Protein GI | 159898789 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0458822 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGCAAC AACGCTGTTG GCTTCGACGC TTAGCTTATT TGATGATCAT CGGTTTGTTA GGCGGTTGTA TCAGCACCAG TGCCAACCAA CAACCGCTGG TGATCACCTT TGGCGCATCG ATCTCGATTA CTGGCAAAAC CGCCAAAGAA GGCGAATATG TGCGTGATGG GTATCAATTT TTTGTTGATA CCCTGAATGC CCAAGGTGGG ATTCTGGTCG GCGGCCAACG CTATCAGTTG CGTTTACGTT ATTATGATGA TGAATCGAAC CTCGAACGCA CAGCTGAGCT GTATGAAAAA TTAATCAATC ACGATCAAGT TGATTTTTTA TTGGGGCCAT ATGGCTCGGA TGCTACCAGC GTTGCGGTAG CGATCGCCGA AAAATATCAT ATTCCATTGG TTTCGGGCCA TGGCTCGGCC AGCAGCATTT ATGCCAATAA CTATCACTAT ATTTTCAGTG TGCAAACGCC CGCCCGCCAC TACTTAAACG GAGTGATGGA TGCAGTATTG GCGGCTGACC CAAGCCTCAA AACGCTGGCC CTGTTGAGCG AAACCGATTC GTTTTCGCAG GATGTCGCCC AAGGTGTGCG TGATTACGCC CAACAGCGCG GCTTAAACGT GGTTTATCAT GGCGATTATC CCAGCGATGC GCGTGATGTG AGTCATCATT TAAATATCAT TAAGCAACTT CAGCCCGATA TGTTGCTCGG TGCAGGTCAT CTGCAAGAGG CTTTGTTAAT TGTCAAGCAA GCCAAAAGCC TCGATCTTAG CCCTAAAGCA ATTGGATTAA GTGTGGGGCC ATTATTGCCG CAATTTCGCG CTAATTTACA ACATGATGCC GATTATATCC TTGGCCCAAC CCAATGGACT CCTGCCCTCG ACTATCATGG CGATGATAGC TGGCAAACCC CAGCGGCTTT TGCCCAAGCC TTTCGTCAGC AATACCCCCA ATATAAATCG GTGCCCTATC AAGTTGCTGA GTCGGCGGCA TCATTGATCG TCTTTCAACG GGCCTTTGAG CGGGCAGGAA CGATCGATCG CTTAGCGGTG CGCGATACAA TTAAAGGCTT AAAACTTGAT ACTTTTTTCG GGCCGATTCA ATTTGACGCG CAGGGCGTAA ACAGCGAAAA GCCCATGGCA GTTGAGCAGT TGCATCCTGA TGGTCAAAAA TATACGGTAT TTCCCCAAGC CGTGGCCGAA CAACCACTGT TGTATCCCAT GCCCACGTGG AGTCAACGCT AG
|
Protein sequence | MWQQRCWLRR LAYLMIIGLL GGCISTSANQ QPLVITFGAS ISITGKTAKE GEYVRDGYQF FVDTLNAQGG ILVGGQRYQL RLRYYDDESN LERTAELYEK LINHDQVDFL LGPYGSDATS VAVAIAEKYH IPLVSGHGSA SSIYANNYHY IFSVQTPARH YLNGVMDAVL AADPSLKTLA LLSETDSFSQ DVAQGVRDYA QQRGLNVVYH GDYPSDARDV SHHLNIIKQL QPDMLLGAGH LQEALLIVKQ AKSLDLSPKA IGLSVGPLLP QFRANLQHDA DYILGPTQWT PALDYHGDDS WQTPAAFAQA FRQQYPQYKS VPYQVAESAA SLIVFQRAFE RAGTIDRLAV RDTIKGLKLD TFFGPIQFDA QGVNSEKPMA VEQLHPDGQK YTVFPQAVAE QPLLYPMPTW SQR
|
| |