Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1838 |
Symbol | |
ID | 5733727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2137435 |
End bp | 2138595 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278982 |
Product | hypothetical protein |
Protein accession | YP_001544609 |
Protein GI | 159898362 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGTTC GTGAGTCGCT GTGGACGATG GATTCGTTTG AAGAAGCCTT ATTTAGCATT AACCACGGAG GAGCCGTGAT CACTCATCTG CCGCGTTTGG TGGCGGGCTT GCCGCCCTCG GTTGCCGCTT TGTGGCGACA GCTGTTCCGC GTTGATGTGA GCCATGGCAC AATTCATCCA CCAGACCGCA TGCATGGCTG GATTGAACGC GCTTTTGGCA GTGTCGAAGC TGTGTTGCAC CAAGAAATTA TTCGGGTTAC CAATCAATGG ACGCTCGAAG GTGCAATTTT CAACCGCATT CGCGGGCTAC GACCACAGCA AGGCCGTTAT CAATCGGATA TTGATCAAAT TATCGCTGCC AGTGCTGGCC CACAGGATGA AGCGGGCTTT TGTACTCCCG AAAGTGGCAC ACCCGAAGAT CGTTTTGGCC GAGTGCGCGG ACGTTACTGT CTGAGCGCTT CCAATGTTGC CAAATTCGAT GGCTGGCATG GCGTGCTGAT CTTCGATCAA CATAATCCTT TAAATTTCAA CCTCGAACGG GTCGAGGATT ATATTCAGAC TGCTGGGCGT TGGTTCGATG CGGTACATAC CGCCGATCCG CAGGCGATCT ATCCATTTTT TATTTGGAAT TGCCTCTGGC GTTCGGGGGC ATCGGTCATT CATGGCCATG CTCAAATGTC GGTTTCCCAT GGCATGGCTT ACCCCAAGGT TGAGTTGTTG CGACGCACTG CTGAGCAATA TCACAACCAA CATAACGCTA ATTATTTCCA AGATCTTTGG CGCGTGCATC GGGCCTTGGG TTTGGGCTTG AATATTCATA GCGACAATAC CCATGGCTAT GTCTCGCTCA CGCCCATCAA GGAAAAAGAG GTCGTGCTGT TTGGCAACGA TGTGCAAGAG TTAGCCCAAT ATCTCTACCA TGTGCTCGAT TGTTTTGTCA CTGAATTGGG TGTGCAAAGC TTTAATGTGA GCATCACCAT GCCGCCGCTC AGCGCTACGC CTGAATCGTG GGCCAATGTG CCAACGATTG TACGGATCGT TGATCGCGGC GACCCCAACA GCCGTGGCTC GGATATTGGG GCGATGGAAT TGTATGCCGC CACGGTGGTT GCCAGTGATC CATTTAATAT TGCCAATGCA TTGGCAGGCC GATTTCCCTA A
|
Protein sequence | MSVRESLWTM DSFEEALFSI NHGGAVITHL PRLVAGLPPS VAALWRQLFR VDVSHGTIHP PDRMHGWIER AFGSVEAVLH QEIIRVTNQW TLEGAIFNRI RGLRPQQGRY QSDIDQIIAA SAGPQDEAGF CTPESGTPED RFGRVRGRYC LSASNVAKFD GWHGVLIFDQ HNPLNFNLER VEDYIQTAGR WFDAVHTADP QAIYPFFIWN CLWRSGASVI HGHAQMSVSH GMAYPKVELL RRTAEQYHNQ HNANYFQDLW RVHRALGLGL NIHSDNTHGY VSLTPIKEKE VVLFGNDVQE LAQYLYHVLD CFVTELGVQS FNVSITMPPL SATPESWANV PTIVRIVDRG DPNSRGSDIG AMELYAATVV ASDPFNIANA LAGRFP
|
| |