Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3530 |
Symbol | |
ID | 5736910 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4444383 |
End bp | 4445480 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641280677 |
Product | hypothetical protein |
Protein accession | YP_001546294 |
Protein GI | 159900047 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000817362 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGCAA CCATTGTGCA GATCGACTAT GCCGTAGTCG ATCCGATCAT TCAACGTTTC CAAAAACTGC ATGATCAGAG CCAAACGATT CAAGCCAGCC TCTGCCAAAC CATGGCCGCG TTACAAGCTG GCCAATGGCA AGGCAACGCC GCCAACGCCT GTTTTCAAGA ATTTGACCAT GTAGTTAAGC CAGCTTTTCA GCGCTTGCTC CACGTCTTGC AAGGCAGTGT TGAAACCACC AAGGCCATTC GGAAGATCAT GGCCGATGCC GAGGCTGAAG CCGCCGCGCT GTTTCGCGGC GATGTTGGCG CGATGGCCGT AGGTGGTAAT GGTGCGATGT TGGTTCAAGA GGCTGATGCG AACACCCCAA CCCCAACCCC GCCTGAACCA CCACTGAAAC TGCCATTCGT GCAAAAGCTG ATCGAATTGG CGCAGCAATT GTTTGCTGAG TGGCAAAAGA ATTTTGGCGA TAAGGTCAAT CCTCATCCTG ATGGCTCAGT AAGTGAGGCA GAAGCAACCA TTATTTTTAA CGATATGGCC AATGAGCCAG ATATTGCATT CAAGTATGCT AATGATGGCT GTTATGCACG GGCACATCTT ATGACCTATC GGATTCATGA ACGGTATGGT ATACCGCTAG AATCGTTAGA GAAAGCGTAT ATCCAAGCAT CAGGAACAGC CCCTGATACC CATCTTACTG TACCAACCGA ATATCGTTAT TCTGATCAAA AGTATGATGA TGTAAGCTCA TATGATGGGA TTGTGGACTG GGGATGGCAT GTTGCACCAA CAGTCAAGGT TAGAAATAAT GATGGTTCAA TTACACCGAT GGTTATCGAT CCATCTCTTT TCAGTCAGCC TGTTTCATTA GAAACCTGGC ATTCTAAAAT GAACGATAAT GATGCGATAT TAAATCTTGT ACCTTATAAT TGGTATACTC CAAGAAAGGG TTTTGAACCT ATTTCATCTT TGGGGAAACC CCATCCAAAC GAGGGTTTAG TAAATTATAC CCATACAAAA GAGGAATTAG ACATTTCTGC TGAAGCAACA ATGATTAATT ATATGAAAAG ATGTGAAGAT TCAGGTTATT GCAAATAG
|
Protein sequence | MPATIVQIDY AVVDPIIQRF QKLHDQSQTI QASLCQTMAA LQAGQWQGNA ANACFQEFDH VVKPAFQRLL HVLQGSVETT KAIRKIMADA EAEAAALFRG DVGAMAVGGN GAMLVQEADA NTPTPTPPEP PLKLPFVQKL IELAQQLFAE WQKNFGDKVN PHPDGSVSEA EATIIFNDMA NEPDIAFKYA NDGCYARAHL MTYRIHERYG IPLESLEKAY IQASGTAPDT HLTVPTEYRY SDQKYDDVSS YDGIVDWGWH VAPTVKVRNN DGSITPMVID PSLFSQPVSL ETWHSKMNDN DAILNLVPYN WYTPRKGFEP ISSLGKPHPN EGLVNYTHTK EELDISAEAT MINYMKRCED SGYCK
|
| |