Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0205 |
Symbol | |
ID | 5732100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 239628 |
End bp | 240950 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641277329 |
Product | hypothetical protein |
Protein accession | YP_001542985 |
Protein GI | 159896738 |
COG category | [S] Function unknown |
COG ID | [COG1690] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.044781 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTCA ACGTTTTGCT CAAGCCATGG CGTGGCGATA AACGCTTCAC GGCCATTGCC AAGTATGCCG CCGCCTTGAA AAAAAGCGGC TCCAAGGATT TGCAAATTGT GGCCCAAGTC GAGGCCGAAT TTGGCAAGCC CACCGAATTA GCTCGGATCA ACGAGCAACC ACAACCCTAC AAGGCTTATG GCCAAATTGG GGTTGATATT GAAGCCGCCG CTTTGGAGCA ATTGCTGTTG GCCTTGCGCC TGCCAATTGC CGCCCAAGGA GCGTTGATGC CCGACGCTCA CCCGGGCTTT GCCTTGCCGA TTGGCGGGGT TTTTGCCGCT CACAACGCCG TCTCGCCCAT GATGGTTGGG GTCGATATCG GCTGCCGTAT GCATCTCACG ATTTTTGCTG AAGCACCCAT GGAAATCCAA CGCCAGCGTG AACAATTATT CCGTGATTTG GCTGACGTGA CGGTGTTTGG CTCAGGCGCA AGTCGCAAAC GCGGGCCAGA TCATCCAATT TTAGGCATCA AACACTGGAA TATCACCGCT CAAACCCGCA GCTTGCGCGA AAAAGCCATC GCGCAACTTG GCACAAGCGG CGGCGGCAAC CACTTTGCCA ATATCGTCGT TGGCGAACGC ATTGGCGAAA ACGATTTGCC GCGCCAATTT GTTGGCTTGC TGACCCATAG CGGCTCGCGC GGAGTTGGTT ATGCGATTGC CAAGCACTAT AGCAATATCG CCGTGCAAGA AACTGCACGC CAAGCCCATG TGCCCAAAAT GTACGAATGG CTCAATTTGG ATAGTGAGGC AGGCCAAGAA TATTGGGCCG CAATGGAATT AGCCGGAGCC TTTGCCCAAG CCAATCATGA AGTGATCCAC CGCTTATTTG CCCAACGCAC CAAACTCCAA CCAATCACCA CCATCCAAAA TCACCATAAT TTTGCTTGGC GCGAAGGCGA TTTGATTGTG CATCGCAAGG GTGCAACACC TGCGGGGGTT GGCGTGCGGG GAGTCATTCC AGGCAGTATG GCATCGGCCT CGTATGTGGT TGAAGGTTTG GGCAACCCCG AAGCCTTGCA TAGTGCCTCG CATGGAGCTG GCCGCTTGTT AAGCCGCTCC AAAGCGCGGG CCACCATCAG CCCAAGTGAA GCCAAAAAAG TCATCAAAGC TCACGGCGTG CATGTCGAAG GCTGGAGCAT CGATGAATCG CCATTGGCCT ACAAAGACAT CGAACGAGTG ATGGAATTGC AAATTGAGGC CGATTTGATC AAACCCCTCG CCCGTATGAA ACCTATCGCC GTAATTATGG CAGGCGAAGC GGGCGAGAAT TAA
|
Protein sequence | MKLNVLLKPW RGDKRFTAIA KYAAALKKSG SKDLQIVAQV EAEFGKPTEL ARINEQPQPY KAYGQIGVDI EAAALEQLLL ALRLPIAAQG ALMPDAHPGF ALPIGGVFAA HNAVSPMMVG VDIGCRMHLT IFAEAPMEIQ RQREQLFRDL ADVTVFGSGA SRKRGPDHPI LGIKHWNITA QTRSLREKAI AQLGTSGGGN HFANIVVGER IGENDLPRQF VGLLTHSGSR GVGYAIAKHY SNIAVQETAR QAHVPKMYEW LNLDSEAGQE YWAAMELAGA FAQANHEVIH RLFAQRTKLQ PITTIQNHHN FAWREGDLIV HRKGATPAGV GVRGVIPGSM ASASYVVEGL GNPEALHSAS HGAGRLLSRS KARATISPSE AKKVIKAHGV HVEGWSIDES PLAYKDIERV MELQIEADLI KPLARMKPIA VIMAGEAGEN
|
| |