Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0854 |
Symbol | |
ID | 5732755 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 966182 |
End bp | 967441 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277986 |
Product | hypothetical protein |
Protein accession | YP_001543630 |
Protein GI | 159897383 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0320102 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCTTC GTGAATGTCC TAATTGTCGG GCAAGTGTAG ATGATGTAAA TCCGTTTTGT TCTGAGTGTG GAACCCGGCT CAACGCTGCG CCAACGGCTG CCGAAGGGCC AACTCAAGCC TTGCCGCGCT ACGATGGTCA GGGCGGCAGT TTTCCACCAC CAGAACAAAA TTATGGTCAA GCTGGTAGCT ATCCACCGCC ACCACAACAA AATTATGGTC AAGGCGGTAG CTATCCACCA CAGCAAAATT ATGGTCAAGC TGGCAGCTAC CCACCACCAC CGCAACAAAA TTACGGTCAA GGCGGTAGTT ATCCACCGCC ACCACAACAA AATCATGGTC AAGCTGGTAG CTACCCACCA CCACCGCAAC AAAATTATGG CAATCAGCAA CCGTACACGC CCGCCAACTA TCAACAACCA AATTATGGCG CGGCCTATCC CAATGCCAGC GTACCGACCA AACGCAGCAA TAACACCGGA ATTGTAATCG GGATTGTCGT GGGGATTCTG GTGCTGATCG GGGCAGGCGT AGCCTTTTTG ATGGATGGCG ATAATGACGA TCCAAATAAT AAAGTCGTCG CAGGCAATGC GACCAGCGTC GCCACCGCCA CTGCACGCCC AACCCGCACC CCAGCGGCAG AAGCAACCGA GGAGCCAATT CCACCAACTG AAGAGCCGAG CACTGGTGGC TTAGCCTTCT TGGAAGACGA TTTTTCGACC ACCGATGAAG CTTGGCCCGA TCAAGAAACT GCCAGCAAAT CGGGCAAATA TGCCTATGTC GAAGATGCCT ACCAAATTCA TGTGTATGTC AATGAACGCA TCATCTGGAC ATCGACCAAC GAAACATATA GCGATGTTGA TGCCCAAATT GATGTAAAAA TGGTCAGCGG CGACCCAACC AATGCAGCCG GCCTTGTGCT ACGTGAGCAA ATCGATGGCG CTGCTTCAGG CAGTTTGTAT GTCTACCAAA TCGATGGTCA AGGCAATATT GCCTTCCGTC GCTACGACAA ACAAACCGAA GAATGGACTA ACCTCGTCGA TTGGAAATTC AGCTCAGTTG CCAATGATGG GATTGGCGCA ACCAATCGGC TGCGCGTGGT GGCGGTTGGC TCGAACTTTA CCTTCTATTT GAATGGTGAA AAAGTTGCCA CCTACAACGA TGATACCTAT GCCAACGGCG CGGTGGGCTT CGGCGCAAGC ACCTTTGATA ATGGTGATAC CTTAACCGAA TTTGATAATT TACGCATGGC TTATCCCTAG
|
Protein sequence | MALRECPNCR ASVDDVNPFC SECGTRLNAA PTAAEGPTQA LPRYDGQGGS FPPPEQNYGQ AGSYPPPPQQ NYGQGGSYPP QQNYGQAGSY PPPPQQNYGQ GGSYPPPPQQ NHGQAGSYPP PPQQNYGNQQ PYTPANYQQP NYGAAYPNAS VPTKRSNNTG IVIGIVVGIL VLIGAGVAFL MDGDNDDPNN KVVAGNATSV ATATARPTRT PAAEATEEPI PPTEEPSTGG LAFLEDDFST TDEAWPDQET ASKSGKYAYV EDAYQIHVYV NERIIWTSTN ETYSDVDAQI DVKMVSGDPT NAAGLVLREQ IDGAASGSLY VYQIDGQGNI AFRRYDKQTE EWTNLVDWKF SSVANDGIGA TNRLRVVAVG SNFTFYLNGE KVATYNDDTY ANGAVGFGAS TFDNGDTLTE FDNLRMAYP
|
| |