Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3055 |
Symbol | |
ID | 5734927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3859317 |
End bp | 3861164 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280199 |
Product | hypothetical protein |
Protein accession | YP_001545821 |
Protein GI | 159899574 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTCTA GGTCGATTAA TCGCCGGCTA TTGGTGCTTC AAGCAGGCTG GACAGTTGCT CAGGCCCAGT TATTGCTCGC CCATAGCCAA GCCGAATATG TGGTTGTTCA ACGCACTGAG CCACAAATCT ACTGGTATGT CTACCCTCTC GAAGTTGTCC AAGAACGCTT AAGCTACCAT CACCCTGATA TTGCGATCTA CCTGGCGCTC AATTTGCAAG AAACCTCAGC TAGTTCGACT CTCAACAGCA ATCAACTCGA AACCGCCGAC TATTTAAGCG TGGTGCTTGA TGATCAGCAA CACTTGCAAG GCGTAGTTAA TCCGAGTGCT CAAGCCAAAG GCACAGAATT TGATCCATTC TTCAAGGCCT ATCCCTCAGT AGTTGCCCAA AACCAAGCCC AACTTAACCA AGCCTTTGAT CTAGCGGTTG GCTTTCGCGA TACGCCTGAT GCTGGCTTAA TCGGTGGCCA TAACCCGATT GTCATTCATG GCTTGCAAAC TGATGAACGT TGCACAATTA TGCTCAGCGG CGATGGCTTG CAGTTTGATC GAGAGCAAGC CGAATTGGCC TTTGACATTC AGGCAACCGT ATTTTTTAAG GCCACGCCAA CCCGCACAGG TCGCTGTGTA ATCTATGTCG ATTATTATCG CCAACGCCAA TTGGTGGGCC ATGCCGAGCG GGTTGTGCTG GTCGATAGCA ACGCTGAACC AGAACCTAGC AATGCTAGCC CGTTTGATTT TGGCTCAACT CCGGTCGATT TACTGATCAA CCTGCGCCGT GATGGCGATA CGTTCAAATG GACGGCAATG CCGCATGATC AAGCCTTTAC ACCGGTGCAC AATTTGCCGA GCCAACAAGC CTTATCCGAG CAGGCTGCCC AAAATTGTGC CGTTGATCTG TTGGGTGCGG CGGTTAATCC AAGTTTATTG TTGGCGCAAC GCGAACTTGA AGCGCTTGCC AGCGATCTAG GCCAATTTGT CCCAAGCCCA ATTTGGCAAT TACACAGCGA TTTGGCCCAG AAATTGCAGC GTCCACTCAC GGTTTTGCTG CGCAGCAACG ATTTGGCTTT GCCTTGGGAA TTAGCGATGG TCGAAGCCCC TTTGCTAGCT GGCGATCAGC CGCTGTATTG GGCCGCCCAA ACCCATTTTG CCCGTTGGTA TATTCACCCC CAAGTTAGCC CAATGCCACC CGATCAACTC AACATTAGCC AAATTAGCGC AATTGCCTCA CGCTATGGCT GGGATTCAGG CCAAGCTGAA TTGGTGCATG CAGTTGATGA GCAAACCATG CTGCAAAACC AATGGCAAGC CCAAGCCTAC GAAGCCACGA TTCAGGCGCT TGATCCATTG TTGAGCCAAG CCACGACCCA AGCTGGCCAT CTTTTACACT TTGCCGTGCA TGGCCGCAGC CAACCCAACG CCCGCATTCA AGAAATTATC TTGGCTGATA ATAATGCGAT TTCGGCCAAA GCCTTGGTGG GCAATACTCG CCGTCGCCCG CCCCAATTTA GCTTTGTGTT TATCAATGCC TGCCAAGTTG CGACCCCAGG CCAGAGCTTA GGCCAAGCGG CGGGCTTCCC CGCCGAAATT CTCAAAAGTG GTGCGGCGGG CTTTGTTGCA CCATTGTGGG AAGCTGATGA TCAAGCAGCC GGAACGTTTG CCGCTCAATT TTATAGCCAA GCGTTTCAAG CCCAACCGTT GGGCGCAATT TTGCAACAAT ATCGCCTAAG TTATGTGGCC AATAGCACCA CCACCCGCCT TGCCTATATC TTTTATGGCC ATCCCGCCTT GCGTTTGGCC TATTCGAGCA AAGGAGCAAC CCATGCCCAA CAACCAAGTG CGGCTTGA
|
Protein sequence | MSSRSINRRL LVLQAGWTVA QAQLLLAHSQ AEYVVVQRTE PQIYWYVYPL EVVQERLSYH HPDIAIYLAL NLQETSASST LNSNQLETAD YLSVVLDDQQ HLQGVVNPSA QAKGTEFDPF FKAYPSVVAQ NQAQLNQAFD LAVGFRDTPD AGLIGGHNPI VIHGLQTDER CTIMLSGDGL QFDREQAELA FDIQATVFFK ATPTRTGRCV IYVDYYRQRQ LVGHAERVVL VDSNAEPEPS NASPFDFGST PVDLLINLRR DGDTFKWTAM PHDQAFTPVH NLPSQQALSE QAAQNCAVDL LGAAVNPSLL LAQRELEALA SDLGQFVPSP IWQLHSDLAQ KLQRPLTVLL RSNDLALPWE LAMVEAPLLA GDQPLYWAAQ THFARWYIHP QVSPMPPDQL NISQISAIAS RYGWDSGQAE LVHAVDEQTM LQNQWQAQAY EATIQALDPL LSQATTQAGH LLHFAVHGRS QPNARIQEII LADNNAISAK ALVGNTRRRP PQFSFVFINA CQVATPGQSL GQAAGFPAEI LKSGAAGFVA PLWEADDQAA GTFAAQFYSQ AFQAQPLGAI LQQYRLSYVA NSTTTRLAYI FYGHPALRLA YSSKGATHAQ QPSAA
|
| |