Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4830 |
Symbol | |
ID | 5736675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 6158223 |
End bp | 6159584 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281995 |
Product | hypothetical protein |
Protein accession | YP_001547588 |
Protein GI | 159901341 |
COG category | [S] Function unknown |
COG ID | [COG2308] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAGAGC AAGCGATTCG TGATTATCAC GCCCTCCTCA CGCCCGATCT CGCTCGTGCT TCTCAAGAAC GTCTCACCAA CCTACAACAG CAACAAAATC TCTTTTTTGG CACCCGTCCG TTGTGCAATG TGCTACGCCC ACATTTTCTC AGCGTCGAGC AATCGAGCTT GATCGAGCGC GTCTCGCAGT TGGTGGCCGA AGCATCGCGC ACGGTGGTTG AGTATGCTTT GCGCACGCCT GAGGTGCTCG ATCTGCTGGC CTTGACCGAG GGTGAACATC AATTAATTAG TTACGAGCCT GGCTACCGTG AATTAAGCGT TTCTTCACGG CTTGATTCAT TTTTAACCAG TGATAGTAGC TCGTTTCAAT TTGTTGAATA TAACGCCGAA AGCCCCGCCG CAATCGCCTA CGAAGATATT CTTTCGCAGG TGTTCGAGCA ATTGCCGATT ATGCAAGAGT TTCAGCGCCA TTATCGGGTT GAAAGTTTGC CTGCTCGGCA ACGCTTGTTA GAAGCTTTTT TGGCGGTTTA TCGCGAATGG GGTGGCACAG GCGAGCCAAA AATTGCCATT GTCGATTGGC ATGGCCTGCC GACGCTCTCG GAATTTCAAC TATTTCAGCA ATATTTTGCT GAGCATGGCT TGAAAACGGT GATTTGTGCG CCGGAAGATT TGCGCTATCA GGCTGGCACG CTGTATGCCA ATAACACACC AGTTAATTTT GTCTATAAAC GCTTGTTGAC AACCGAATTT TTGCAGCGTT TGGGCAATGA AGCCTTTGAT CATCCATTAA CTCAGGCCTA CCGTGATGGA GCAATTTGTT TGGCCAATAA TTTTCGCGCC AAATTGCTGC ACAAAAAAAT GATCTTTGGC TTGTTATCTG ATCCGGCGAT CACTAGCGCC GCTGGAATTA GCTCAGCCAC CCAGCAACAG CTGGCCCAGC ACATTCCTTG GACGCGGCGA GTGACGGCTG GCCGCACCGA TTATGCTGGC ACAGAAGTTG ACCTACTCGA TTTTATTCGG CAAAACCGTG ATCGACTGTT GCTCAAGCCC AACGACGATT ATGGCGGCCA CGGGATTACG ATTGGTTGGG AAACCGAGGC CGAAGCTTGG GATTTGGCGC TACAGCAGGC CTTAACTGAG CCATTTGTGG TGCAAGAGCG CGTAGTGATT GCCTACGAGG ATTATCCAGC TATGGTGGAT GGTCAATTGC AGATCGGCCA GCGCTTGGTC GATACCGATC CATTTTTATT TGGCAGCGAA GTTCAAGGCT GTCTGACGCG CTTATCGACG GTGACGTTGC TGAATGTGAC CGCCGGCGGC GGCTCGACCA CACCAACGTT TCAGCTCTCT AAACTGAGCT AA
|
Protein sequence | MLEQAIRDYH ALLTPDLARA SQERLTNLQQ QQNLFFGTRP LCNVLRPHFL SVEQSSLIER VSQLVAEASR TVVEYALRTP EVLDLLALTE GEHQLISYEP GYRELSVSSR LDSFLTSDSS SFQFVEYNAE SPAAIAYEDI LSQVFEQLPI MQEFQRHYRV ESLPARQRLL EAFLAVYREW GGTGEPKIAI VDWHGLPTLS EFQLFQQYFA EHGLKTVICA PEDLRYQAGT LYANNTPVNF VYKRLLTTEF LQRLGNEAFD HPLTQAYRDG AICLANNFRA KLLHKKMIFG LLSDPAITSA AGISSATQQQ LAQHIPWTRR VTAGRTDYAG TEVDLLDFIR QNRDRLLLKP NDDYGGHGIT IGWETEAEAW DLALQQALTE PFVVQERVVI AYEDYPAMVD GQLQIGQRLV DTDPFLFGSE VQGCLTRLST VTLLNVTAGG GSTTPTFQLS KLS
|
| |