Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4743 |
Symbol | |
ID | 5736587 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 6051861 |
End bp | 6053297 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641281908 |
Product | hypothetical protein |
Protein accession | YP_001547502 |
Protein GI | 159901255 |
COG category | [S] Function unknown |
COG ID | [COG1543] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAAGC CCACGACTGA CCAACCAACC GATCTCTGCA TCATTATGCA TGCCCATGCG CCGTTTGTGC GCCAACCTGG GCCATCCAGC CATGGCGAAG AAGAGTTACA TGGCCTGATG GCCGATTGTT ATATTCCGCT TTTACGCGGC CTCGCCGACT TGATCGAGCA AAATAAAGCG GTCAAAATGG GCTTGGCCAT CAGCCCAATT CTGTTGCATC AATTGAATGA TCCAGTTGTG CAAAAGCATA TGCTGCATCA TCTCGATGAG GCGATTGTTC AGGCCGAGCG GCGGCTTGAA GCAACCAAGG CCGATGAGCA CGCAGCCTAT CTAGCCCAAT TTTATTTACA ACTGCATCAA CGCACCTTAC ATGTCTTTGA AAAGCAGTGG AGCCGTAATG TTGTAGCCAT GACCCGTGAG TTGGTCGAGG CTGAGGCCTT GGAAGTGCTG GGCCATACGG CAACCTATCC CGTACCACAT TTATACGATC GCCCGACTTT GCGCACCCAA GTCGAAATTG GCACGATCGC GCTGATGCAT TTGCTCGGCC AAACTCCAAG CGTCTTGTGG AGCGCCGAAG ATCCAGTTTC GCCAGAGTGG GAGCAGATTC TCAAGCCTTT GGATATGCGG GCCTTGGTCG GGACTGCCAA CCCTGAGTTA CGCGGACAAG CCCTTTCGTG GCTTTCCGAA GCTCGTCAAA CCGCTGTGAT TACGCCCGAT CAGCGCTTGC TGACGCATAT TCGCTCGGTT GGCTTGGGCT ATCCTGGCGA TCCGCTATAT CGTGCGCCGA TGGAAATTCC CATGGATGAT GGCATGCTGA GTCGTGATGG CAGCGCCTAT GATCCATTTT ATGCCTATGC GCGGGCGCGG ATGCATGCCC GTCACTTCAC CGCTGTGCTT AGCGAAACCG CCAGCAACGG CCAGCCCTTA GTGATTGTGG CTGATATGGA ATTATTTGGT AGCGGCTGGT TTGAGGGTGG TTTATGGCTG CGCACGTTGC TCGAAGAATT GCCCAGTCAT TTTCGCTTGG TTACGCCTAG CACCCTGCTC AAACGCCATA AACCTAGCGT GGTTGTGCCG CCACAGCATT TCGCTGGAAT TATCGATGGC CGTTTGAGCG CGGCCCATCG CCAATTACAA GCGGCGGTAG CGCAATATCC CCAAGCTTAT GGCGACCGCG AACAGGTGCT CAACCAAGCT GCCCGTGAAT TTTTGCTGGC ACAATCAGGC GATTGGGATC GCTGGGTTGG TACGCCTGGC GCAGATTATG CCAATGCTCG GCGCAGCGAA CACCTGCAAA AATTCGATTG GCTGATGCAG TTATTGCCGC ATGAAACCTT GTTACCAGTT GCCGCCCGTG AATTTGCCGC CTTGCAAGAG CGCGATAATC CTTTTCCAAC CCTGAATTAT CGTTTGTTTG GGAATTTTGA ACGATGA
|
Protein sequence | MEKPTTDQPT DLCIIMHAHA PFVRQPGPSS HGEEELHGLM ADCYIPLLRG LADLIEQNKA VKMGLAISPI LLHQLNDPVV QKHMLHHLDE AIVQAERRLE ATKADEHAAY LAQFYLQLHQ RTLHVFEKQW SRNVVAMTRE LVEAEALEVL GHTATYPVPH LYDRPTLRTQ VEIGTIALMH LLGQTPSVLW SAEDPVSPEW EQILKPLDMR ALVGTANPEL RGQALSWLSE ARQTAVITPD QRLLTHIRSV GLGYPGDPLY RAPMEIPMDD GMLSRDGSAY DPFYAYARAR MHARHFTAVL SETASNGQPL VIVADMELFG SGWFEGGLWL RTLLEELPSH FRLVTPSTLL KRHKPSVVVP PQHFAGIIDG RLSAAHRQLQ AAVAQYPQAY GDREQVLNQA AREFLLAQSG DWDRWVGTPG ADYANARRSE HLQKFDWLMQ LLPHETLLPV AAREFAALQE RDNPFPTLNY RLFGNFER
|
| |