Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1300 |
Symbol | |
ID | 5733193 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1508547 |
End bp | 1509701 |
Gene Length | 1155 bp |
Protein Length | 384 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641278440 |
Product | peptidase C2 calpain |
Protein accession | YP_001544076 |
Protein GI | 159897829 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGCCC CGATTGTGCA AATTGATTAT GAGCTGATCA AGCAGGTTGC CCAGCGGTTT CAGCGCCAAA CCGAGCAGGT GCAAACAATT CGCCTGCAAA TTCAACAGGT TGCCGAGCCA TTAATTGCTG GGGCATGGCA AGGTGCTGCG GCAACGGCCT TTGCCAACGA ATATCAAACC CAACTACTGC CCACCCTACA ACGCTTAATG ATTGTTTTGC ATACTGCCCA GCAAGTTAGC CTTGAATTGA GCGGTGTATT GCACGAAGCT GAACGTGAAG CCGCTAGCTT GTTTCGGGCC GAAGTGGTGC TTAATCAAAC AACTGATCAG GCAGGCAAAT ACAAAGATGC CTATCTCGAA ATTAGCGAAA TGCGCCCAGT TGAGGGTGAA TTATATTTAG CTGGCGGGGC TGATATGCGC CAAGGCATTC ACCCCAGCGA TGCTGATCAA GGCCAGATTG GCAATTGTTT TGTGGTAGCT TCGCTGGCGG CGGTAGCCCA AAATAACCCC GATGTGATTC GTAATGCAAT TGAAGATAAT GGCGATGGCA CCTATACCGT TACATTTTAC CAGCGCGAAG CCGATACGCG CTTTAATCGT TTAAATAATT GGTTTGATAA TGGCTTTGAT CCGGTGAAAA TCACCGTAAC TGCTGAATTT CCAGTGCTTG CTGATGGCAC ACAGCCCTAT ATCCACGAAA ATCAAGAAGT GTTGGATGGC AAACGCGAAT TATGGCCAGC AATTATGGAA AAAGCCTACG CCCAATTTCT GAGTCAAAGC AATAATCCAA TTGATATGTA TAGTACGCTC AACAAAGGTG GTAACCCTGC CGATGTGCTA GAGGCGATTA CTGGTCAACG TAGCGCGATT AACGAACCTC AAAGCTACAG CATTCATCAA CTAGCCACGA TGCATAATAA TCAACAAGCG ATTATTTTTG GCACGCCTGA TCCAAGCGAT CCGAGCGTTA ATCAACCAGC GTTTATCAAT AAACAACTGC AACCGAAGCA TGCCTACTAT GTGAGCCATA TCGATCAACA GCGCAATTGG GTGACCTTGC GCAATCCATG GTCGTGGGAT GAATCACCAG TCACGGTCGA TTATGCGGAT CTTGAGCAGG TGTTTAATGT TGTTATAACC AATCCAATTG ATTAA
|
Protein sequence | MPAPIVQIDY ELIKQVAQRF QRQTEQVQTI RLQIQQVAEP LIAGAWQGAA ATAFANEYQT QLLPTLQRLM IVLHTAQQVS LELSGVLHEA EREAASLFRA EVVLNQTTDQ AGKYKDAYLE ISEMRPVEGE LYLAGGADMR QGIHPSDADQ GQIGNCFVVA SLAAVAQNNP DVIRNAIEDN GDGTYTVTFY QREADTRFNR LNNWFDNGFD PVKITVTAEF PVLADGTQPY IHENQEVLDG KRELWPAIME KAYAQFLSQS NNPIDMYSTL NKGGNPADVL EAITGQRSAI NEPQSYSIHQ LATMHNNQQA IIFGTPDPSD PSVNQPAFIN KQLQPKHAYY VSHIDQQRNW VTLRNPWSWD ESPVTVDYAD LEQVFNVVIT NPID
|
| |