Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0470 |
Symbol | |
ID | 5732369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 549987 |
End bp | 551285 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641277596 |
Product | hypothetical protein |
Protein accession | YP_001543249 |
Protein GI | 159897002 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000200568 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAAGTG ACATAGCCAG CGAGCTAGGA ATTGTCTTGG TATTGCTGGT TGCCAACGGG GTTTTTGCTG CATCGGAGCT AGCCATGGTT TCGGCGCGGC GTTCTCGCTT AGAACAACAA GCCGCCGATG GCGATCTTCG GGCAAAGAAA GCCCTGCAAC TCGCCGACCA ACCTGATCGT TTATTAGCCA CGGTTCAAGT TGGCATCACA TTAATTGGTA CGTTTGCAGC AGCTTTTGGG GGTGCTAACA TTAGCAAGCC CTTTGCCGAA TACCTAAAAA CGGTTCCTGC GCTCGCTCCC TATGCCGATT CAATTGCATT TACGGTGGTT GTATTGCTCA TCACCTATCT TTCGTTGATT ATTGGTGAGC TTGTGCCCAA ACGCTTAGCA TTGCTGCATG CCGATGCGAT TGCCCGTAAT TTAGCGCCGC TGCTGGCTTG GCTTTCGTGG CTAACCCGGC CAATTGTCTG GGTCTTGACC ACCTCATCGA CAGTGATTCT GACGTTAATT GGCCAAAACA AAAAACCCGA TTCTAGCGTA ACCGAAGATG ATATTTTGTA TATGACTCGC GAAGGTCGGG CTGGCGGCAC GGTCGCCTTG CACGAAGAGG CCTTGATCTC ACGGGTATTT GATTTTTCGG ATCGTACAGC CCGCATGTTG ATGACCCCAC GGCCTGATGT TGTAGCGGTC AGTGCTAACA CTCCCTTGGA CGAAATTACG CGGATTGCGG TTGAACATGG CTATTCACGC ATGCCAGTTT ACGAAGGTGA TTCGCTAGAT CGAGCAATTG GGACGATCTA TATTAAAGAT GTGTTGCCTT CAATGTTGGG CAACGATCAA CGCCAATTGC GTGAATTGGT GCGCCCGCCA ACGTATGTAT TGGAGCACGA ACCAGTTTCC AAGATGTTGA CCCTGTTTCG GCGCACTGGC TCACACATGG CCTTGGTTGT CGATGAATAT GGCCAAATTG CTGGAATTTT AACCCTCGAA GATGTGCTTG AAGAATTGGT TGGCGATATT CGCGATGAAT ATGACTCCAA CGAAGAACAA ACCATGGTCA AACGTGATGA TGGCTCATGG CTGATCGACG GCTCGGAATC ATACGAAGTG ATTGCAACCC GCTTGAGTAT TCCAATTAAT GATGATAACG ACTTTGTGAC GATTGCTGGC TATGTGTTGA ACGAACTGCA TCGGTTGCCC AACGTTGGCG ATCATGTAAC TTGGGATGAA TACGATGTCG AAGTGATCGA TATGGATGGT CGGCGGATTG ATAAGGTGTT GATCAAAAAA CGCGCCTAA
|
Protein sequence | MLSDIASELG IVLVLLVANG VFAASELAMV SARRSRLEQQ AADGDLRAKK ALQLADQPDR LLATVQVGIT LIGTFAAAFG GANISKPFAE YLKTVPALAP YADSIAFTVV VLLITYLSLI IGELVPKRLA LLHADAIARN LAPLLAWLSW LTRPIVWVLT TSSTVILTLI GQNKKPDSSV TEDDILYMTR EGRAGGTVAL HEEALISRVF DFSDRTARML MTPRPDVVAV SANTPLDEIT RIAVEHGYSR MPVYEGDSLD RAIGTIYIKD VLPSMLGNDQ RQLRELVRPP TYVLEHEPVS KMLTLFRRTG SHMALVVDEY GQIAGILTLE DVLEELVGDI RDEYDSNEEQ TMVKRDDGSW LIDGSESYEV IATRLSIPIN DDNDFVTIAG YVLNELHRLP NVGDHVTWDE YDVEVIDMDG RRIDKVLIKK RA
|
| |