Gene Haur_1300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1300 
Symbol 
ID5733193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1508547 
End bp1509701 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content47% 
IMG OID641278440 
Productpeptidase C2 calpain 
Protein accessionYP_001544076 
Protein GI159897829 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGCCC CGATTGTGCA AATTGATTAT GAGCTGATCA AGCAGGTTGC CCAGCGGTTT 
CAGCGCCAAA CCGAGCAGGT GCAAACAATT CGCCTGCAAA TTCAACAGGT TGCCGAGCCA
TTAATTGCTG GGGCATGGCA AGGTGCTGCG GCAACGGCCT TTGCCAACGA ATATCAAACC
CAACTACTGC CCACCCTACA ACGCTTAATG ATTGTTTTGC ATACTGCCCA GCAAGTTAGC
CTTGAATTGA GCGGTGTATT GCACGAAGCT GAACGTGAAG CCGCTAGCTT GTTTCGGGCC
GAAGTGGTGC TTAATCAAAC AACTGATCAG GCAGGCAAAT ACAAAGATGC CTATCTCGAA
ATTAGCGAAA TGCGCCCAGT TGAGGGTGAA TTATATTTAG CTGGCGGGGC TGATATGCGC
CAAGGCATTC ACCCCAGCGA TGCTGATCAA GGCCAGATTG GCAATTGTTT TGTGGTAGCT
TCGCTGGCGG CGGTAGCCCA AAATAACCCC GATGTGATTC GTAATGCAAT TGAAGATAAT
GGCGATGGCA CCTATACCGT TACATTTTAC CAGCGCGAAG CCGATACGCG CTTTAATCGT
TTAAATAATT GGTTTGATAA TGGCTTTGAT CCGGTGAAAA TCACCGTAAC TGCTGAATTT
CCAGTGCTTG CTGATGGCAC ACAGCCCTAT ATCCACGAAA ATCAAGAAGT GTTGGATGGC
AAACGCGAAT TATGGCCAGC AATTATGGAA AAAGCCTACG CCCAATTTCT GAGTCAAAGC
AATAATCCAA TTGATATGTA TAGTACGCTC AACAAAGGTG GTAACCCTGC CGATGTGCTA
GAGGCGATTA CTGGTCAACG TAGCGCGATT AACGAACCTC AAAGCTACAG CATTCATCAA
CTAGCCACGA TGCATAATAA TCAACAAGCG ATTATTTTTG GCACGCCTGA TCCAAGCGAT
CCGAGCGTTA ATCAACCAGC GTTTATCAAT AAACAACTGC AACCGAAGCA TGCCTACTAT
GTGAGCCATA TCGATCAACA GCGCAATTGG GTGACCTTGC GCAATCCATG GTCGTGGGAT
GAATCACCAG TCACGGTCGA TTATGCGGAT CTTGAGCAGG TGTTTAATGT TGTTATAACC
AATCCAATTG ATTAA
 
Protein sequence
MPAPIVQIDY ELIKQVAQRF QRQTEQVQTI RLQIQQVAEP LIAGAWQGAA ATAFANEYQT 
QLLPTLQRLM IVLHTAQQVS LELSGVLHEA EREAASLFRA EVVLNQTTDQ AGKYKDAYLE
ISEMRPVEGE LYLAGGADMR QGIHPSDADQ GQIGNCFVVA SLAAVAQNNP DVIRNAIEDN
GDGTYTVTFY QREADTRFNR LNNWFDNGFD PVKITVTAEF PVLADGTQPY IHENQEVLDG
KRELWPAIME KAYAQFLSQS NNPIDMYSTL NKGGNPADVL EAITGQRSAI NEPQSYSIHQ
LATMHNNQQA IIFGTPDPSD PSVNQPAFIN KQLQPKHAYY VSHIDQQRNW VTLRNPWSWD
ESPVTVDYAD LEQVFNVVIT NPID