Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3424 |
Symbol | |
ID | 5735285 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4313483 |
End bp | 4314694 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641280571 |
Product | hypothetical protein |
Protein accession | YP_001546188 |
Protein GI | 159899941 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000157289 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACGTTC AGCCAACTAG TCCGATTCGT TTCTCACCTC GCGCAAAGTT TGTAACAGTC CTTTCGATTG TTATTATTAG CCTGTACTTG GTCAACCAAG CTTGGCATAT TATGATCCCC TTTATTTGGG CGATCATCAC GGCCTATATT TTTAATCCAG TTGTTACCTG GATGCAACGG CGTACCAAAA CCCCCCGCTT CTGGTGGGTG ATTGTGCTGT ATATCATTGG TTTTACGCTG TTAACTTTGC TTGTGAGTAA CGTCGTGCCA CTGATTACCT CGCAGATTAA CGATTTGGCC CAAACCATTC CTATATGGCT TGATGATGCT AAAATCTATG TTGAGCAGCA TCAAGCGTTG ACCATTCGTG GGGTTGAAAT TGTTAATCTG CGTGAAGCTG AACGCGAAAT CTCGGGGGCA ATTGCGGCGT TTGCCAAAGC CCTGCCTGGT ATCGCCCCGC TGTTTCTGTT TGGGCTAGCC GAAGGCTTTA TTTTGCTTTT GGTTTATTTC GTCGTAACGT TTTATTTATT GCTCCAAGCC GAAACCATCC CCAACAATTT CTATCGCTTA ATTCCCGAAC GCCATCGTGA TGAAATTCGT CAATTGATTG GTTCGATCGA TCGGGTGCTC TCAGCCTACA TTCGCGGTCA GTTTTTGCTG ATTATCATTA TGTCGGTGTT GAGTTACATT GCCCTATCAA TTTTGCAAGT CAAATATGCC TTGATTATCG CCATCGCCAC CGGCTTCTTG GAAATTATCC CAATCATCGG GCCGTACATG GCCACCGCCA TCGCCGCTTT GGTAGGCTTT TTCCAAGGCA CAACCCCATT TGGCTGGGAG CCTTGGGTAT TAGCGTTGGT AATTATTGCC GTCTATTTTG CCCTGCGCCA ATTCGAAGAT CACTTTATTA TTCCAAATTT GGTCGGACAT ATCGTCAATG TGCATCCGGT CTTTGTGATC TTTGCAATTT TGGCGGGGGG AGCGGTAGCC GGCGGCTTAG GTTTACTCAT ATCGATCCCG ATTGCCGCAG TTGCTCGCAT CATTCTTTCA TATCTGCATG GCAAGCTTGT CGATTCGCCC AACCCGCAGG CCGATTTGGT CGAACAAGAG GGCAAATTGC TTGAAAACCA TATAGTGGCT GAGCGTCGCG AAAGTCGGCG CAACCGCCGC GAAGCCAAAA AACAGGTCGA AGGCAGCGAT CTCAAACCCT AA
|
Protein sequence | MHVQPTSPIR FSPRAKFVTV LSIVIISLYL VNQAWHIMIP FIWAIITAYI FNPVVTWMQR RTKTPRFWWV IVLYIIGFTL LTLLVSNVVP LITSQINDLA QTIPIWLDDA KIYVEQHQAL TIRGVEIVNL REAEREISGA IAAFAKALPG IAPLFLFGLA EGFILLLVYF VVTFYLLLQA ETIPNNFYRL IPERHRDEIR QLIGSIDRVL SAYIRGQFLL IIIMSVLSYI ALSILQVKYA LIIAIATGFL EIIPIIGPYM ATAIAALVGF FQGTTPFGWE PWVLALVIIA VYFALRQFED HFIIPNLVGH IVNVHPVFVI FAILAGGAVA GGLGLLISIP IAAVARIILS YLHGKLVDSP NPQADLVEQE GKLLENHIVA ERRESRRNRR EAKKQVEGSD LKP
|
| |