Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2935 |
Symbol | |
ID | 5734807 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3710278 |
End bp | 3711303 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280079 |
Product | hypothetical protein |
Protein accession | YP_001545701 |
Protein GI | 159899454 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000152042 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATTTGA AGAAATGGTC GTTGATTGGA GCTGCATTGG TTGGTTTATT GCTCGCTCAG CCAATTACCC CAAGCACAGC TAGCCCACAA GCTTTGGATA TCGCTACTAG AGGATCGATT AGTTGTGCCA GCCCGACGAC TGATCGAATC GATTGCTTTG TGGTGGGCGA TGATAAACAA GTTTGGCAAC ATTCATGGCA AAGCGGTTTT GGTTGGGGCA CTTGGGTTGG TTTGGGCTTT CCCTATTTCA CAATCGATCC AGGCGGGGGT GTTTCGGCGG TTGCGCGTGA TGGCACGACC ATTGATATTT TTGTGAGTGG CAAGGATGGT AATTTTTTCA ATCTATACCA ACGCACCTGG AATGGCACAA CTTGGAGCGA TTGGACGAGC TTAGGCGGGC CAGGCTCGTG GGATATTTAC GAATTGAGCT GTACCTCAGC TTCGGCGACC AACATTAGCT GTTTTGTGCG AGCGAGCGAC AACCACCTTT GGCAAAAAAC CTGGAATGGC AGCAATTGGT CAACGTGGAG CGATCTTGGC ACATTTGTTA ATGGCGGCCC ATTTAAAGGT CTCGCTTCAA GTAATTATAA CGCTGATAGT ATTCTGCTTT TTGCAATCGG CAGTGATGGT CATGCCTATC GTCGTTTTTA CGAAAGCGGC ACATGGAGTG GCTGGCTCGA TGATGGCGGG CCAACTGGCA GTAGCTTCGA TCAAGTAAAT TGCCAGGATC AAGCAGGCAG CAAGATCACA TGTGCCTTTA CTGATACCAT GGGGGACGTT TGGTATCGCA CATGGGATGC AGGTTGGAGC GCATTCACGC AGCTAGCAAC CCCGCCATCA AGTGCTAAGG GGGTTGCGGT AGCCCATTAT TCTAGCTTGG GAAGGGTGAT CGTTACCAAT GGCTTTGATG GTAGGCTTTA TCGAACATTT CAGCCAGACC CAACCAGCGC TTGGGCTAGT TGGCTCGACG ATGGTCGCCC GCAAGATCTT CAAATCTTTC TGCCAATGGC GATAAAGCCC AACTAA
|
Protein sequence | MHLKKWSLIG AALVGLLLAQ PITPSTASPQ ALDIATRGSI SCASPTTDRI DCFVVGDDKQ VWQHSWQSGF GWGTWVGLGF PYFTIDPGGG VSAVARDGTT IDIFVSGKDG NFFNLYQRTW NGTTWSDWTS LGGPGSWDIY ELSCTSASAT NISCFVRASD NHLWQKTWNG SNWSTWSDLG TFVNGGPFKG LASSNYNADS ILLFAIGSDG HAYRRFYESG TWSGWLDDGG PTGSSFDQVN CQDQAGSKIT CAFTDTMGDV WYRTWDAGWS AFTQLATPPS SAKGVAVAHY SSLGRVIVTN GFDGRLYRTF QPDPTSAWAS WLDDGRPQDL QIFLPMAIKP N
|
| |