Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1269 |
Symbol | |
ID | 5733162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1480413 |
End bp | 1481576 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641278409 |
Product | hypothetical protein |
Protein accession | YP_001544045 |
Protein GI | 159897798 |
COG category | [S] Function unknown |
COG ID | [COG3287] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACAC ACGCAGCTAC CGGTGCATCG CAGATTGAGC ACAGCTATGA TGCTGGTTTC GCCGCCGCCC AGCAAGCCTG TACGCAGCTT GCCCCCCATT CGCCAACCTG TCTGATCGCC TTTACCACTG ATGCCTATGA TCAGGCCGCC GTTGTTCAAG GCATTCGTGC AGCAAGCCAA CAAGCGCCAT TAATCGGCTG TTGTGCTGGC GGCATTATTA GCAATGCCGG CACTTTTACC CATGGTGTGG TGCTACTCGC ACTGGCTTCC GATGATCTCA AGCTGGATTT GAGCCTTGTC GCAGGGGTCA AAGCCGATCC GGGCAAGGTT GCCGATCAGT TGGCTGATCA ATTAGAGGCT GTTTTAGATA ACCCCACTAG TCAACAAGCT GCCTTGATCT TGGTCGATGG TTTGGCTGGC ACCTTGACCG ATTTTGTGCA GCATGCCACC GCTGCTTTTG GCCCGTTATG CCCGCTGGTT GGTGGCGGCG CTGGCGATAG CTTCCAATTC AAACAAACCT ATCAATTTGT CAACGATCAG GTCATTAGCG ATGGTGCGGC AGTCGGCCTA TTGCAATCGC CAACTCCGAT GGGCATTGGG GTACAACATG GCTGGGAGCC TGCTGCAAGA GGTTTGGTGG TTACACGCAG CGAAGGCACA ATTATTTACG AGCTTGATGG ACGGCCTGCC TTTGCAGTCT ATCAAGAGCT TTTCCCCGAT TTGACAGTGG AAAATTTCGG GCGCTTTGTG ATCGATCATC CGATTGGCTT GCCCCAAATT AATGGCGAAT TTTTGATTCG CGACCCGTTG CGCACCCATC CTGATGGCTC GATCGAATGC ATTGCCAGCG TGCCCAAAAA TGTGGTCGCG CATATTATGC ATGGCTCGCA TGAAACCTTG TTCAACGCTG CTCAACTGGC CACGAAACGG GCCTTGGCTG CGCTAAATGG GCCACCAGCA GCATTAATTA TTTTCGATTG TGTTTCGCGT CTGGCAATGT TGGGCGATGC AGCCGCCACC GAAGTGCAAC GCATTCGCGA AGTTGCTGGC TTGGATGTGC CAGTTGTCGG TATGTTTAGC TTTGGCGAAA TTGCTGCTGC GGAAACAGGC GGAGCGCTGT TTCATAACAA AACCGTTGTT GTGTACGCTA TTGGTCAGGC CTGA
|
Protein sequence | MTTHAATGAS QIEHSYDAGF AAAQQACTQL APHSPTCLIA FTTDAYDQAA VVQGIRAASQ QAPLIGCCAG GIISNAGTFT HGVVLLALAS DDLKLDLSLV AGVKADPGKV ADQLADQLEA VLDNPTSQQA ALILVDGLAG TLTDFVQHAT AAFGPLCPLV GGGAGDSFQF KQTYQFVNDQ VISDGAAVGL LQSPTPMGIG VQHGWEPAAR GLVVTRSEGT IIYELDGRPA FAVYQELFPD LTVENFGRFV IDHPIGLPQI NGEFLIRDPL RTHPDGSIEC IASVPKNVVA HIMHGSHETL FNAAQLATKR ALAALNGPPA ALIIFDCVSR LAMLGDAAAT EVQRIREVAG LDVPVVGMFS FGEIAAAETG GALFHNKTVV VYAIGQA
|
| |