Gene Haur_1815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1815 
Symbol 
ID5733673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2110732 
End bp2112978 
Gene Length2247 bp 
Protein Length748 aa 
Translation table11 
GC content51% 
IMG OID641278958 
Productpeptidase C11 clostripain 
Protein accessionYP_001544586 
Protein GI159898339 
COG category 
COG ID 
TIGRFAM ID[TIGR02806] clostripain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00011349 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCAGA TTGCGCCGCG ACGTTGGCGT TTGACTCGTT TTAGCCTCAT TTTACTCTTG 
CTCTTAGCCG CTTGTGCCGA TCTCTCCGAA CCAACTCCGG TGGCAAAACG CACTCCGATT
GCTGGCCAAG CTACGCCCGC CGCCCCAACT CGCACCCCTG CTGGCCGCGA TGATCAGAGT
TGGCTGATTA TGCTCTACTC CGATGCCGAC GATGAGATTC TTGAAGAAGA TATGCTCAAC
GACATCAACG AGGCTGAACT GGTTGGCTCA ACCGATCGGG TGCGGGTAGT AGCTCAGGTC
GATCGCTATG ATGGCGGCTT CGATGGCGAT GGTGATTGGA CGAGCACCAA ACGCTTTTAC
ATCGAACAAG ATGACGACCT TGAGCAAATG AACTCCAAGG AACTCGCCGA CCTTGGCGAA
GTCAATATGG CCGATGCCGA TACCTTGACT GATTTTGTCA CATGGGCCGC CAAAACCTAT
CCCTCGGACA AATATGTGTT AATTATGTCT GATCATGGGG CTGGCTGGCC GGGCGGCTGG
AGCGACCCTG ACCCTTCAAC CACAGGTCGC CACGATATTC CGCTGGCCGA GAGCTTTGGC
GATATGCTTT TTCTCATGGA AATGAGCGAG GCACTCGAAC ATATCATCGC TGAAACCAAT
ATTGGCGAAT TTGAGTTAAT TGGCTTTGAT GCCTGCTTGA TGAGCCATGT TGAGGTCTAT
AGTGCGATCG CCCCCTATGC TCGCTATGCT GTGGCCTCGC AAGAAGTCGA GCCATCGCTG
GGCTGGGCTT ATGCTGCAAT TTTGGGGCGA TTAACCGATA GCCCTGAAAT CGATGGCGCT
GAGCTTTCGC GAGCAATTGT CGATAGCTAT ATCGAGCAGG ATCAACAAAT TCTCGATGAT
GATGCCCGCG CCAAATATGT TTCGCGCACC TACGATTTTG AGGGCAATGT TTCTGCTGAA
GAAGTTTTAG AGCAAGAGCG CAAGGCCATC ACCCTGACGG CAATCGATTT GGGCAAATTG
CCTGCGGTGA TCGATGCACT TGATCGTTTG GTGATTACTT TGGCCGAGGC CGAACGCAAA
GACATTGCAG CGGCACGACG CTATACCCAA GCGTTTGAGA GCGTGTTTGA TAGCGATCAA
CCTAAGCCCT ACATTGATCT TGGGCACTTT GCCCAATTGC TCAAGCAAAA AGTTAATGTG
CCCAGCGTCA ACAAAGCCGC TGATGAGCTG ATTGCAGCGA TTGATCGCAG TTTGATCGAA
GAGAAACATG GCGATGAAAA GGCTGGGGCA ACGGGCATTT CGATCCACTT TCCCAATTCC
AAACTGTATA CCAGTGCCGA TGCTGGCTAC AAATCCTATA ACATGGTTGC CGTAAACTTC
GTCAACGATT CGCTGTGGGA TGAATTTTTG GCCTTCCAAT ACGCCAAAAA GCCATTGCCA
ACCACCATTG AGCAACCAAC CGCAACCGTC GAACGTCAGC CAACCCCAAC GCCTGAGCCA
ATTGATGTGA CCGAGGTTGA AGCACCTGGC TCCGAGCCAA TTACGGTTGC TGCGATTGAA
CTTTCAAGCA CAACCGCCAG TCTCGATCAG CCTGTTGTGC TCACCAGCAG CATCACTGGC
GATAATATTG CCTTCGTTTA TATCTTCATC GGCTACTATG ATCAAGAGTC CGATTCAATT
CAAGTTTTGG ATATGGACTA CCTTGACGCT GAACAAACCC GCGAAATCGG CGGAGTCTTT
TATCCCGATT GGGGCGAGGC CTCGACGATC GATATTGAGT TTGAGTGGGA TACGCAAGTT
TTTGCCATGC ACAACGAAAC CACCGCAAGT CTAGCCTTGT TTAGCCCCGA AGATTATGGA
GCATCGCCCG AAGATGCCAC CTACACCGTC GAAGGCTTGT ACACCACTGC CAAAGGCAAA
AAGACTCGAC GCGCGTTGTT GTTGTTCAGC AATGGCGAGT TGGTACAAGT GCTCGGGTAT
ACTGGCAAGG AAGATACTGG TTCGTTGCGC GAAATCAACC CTAAGCGTGG CGATAAATTT
GTGGTGCTCG ATACTTGGCT TGAAGAATCA CAACAAACTG GCGAGAGCGA ATTTGTCAAT
TACGAAGGCG AAACCTTTGT GTTTGGCGAT GACAACTTTA CTTGGGAGCT TGAACCAGCT
CCTGCTGGGA ATTACCTCGT TGGCTTCTTC GCTGAAGATT TTGATGGTAA TGTCTACTCA
GCCTATGAAA CCTTGATTAT CGAGTAA
 
Protein sequence
MQQIAPRRWR LTRFSLILLL LLAACADLSE PTPVAKRTPI AGQATPAAPT RTPAGRDDQS 
WLIMLYSDAD DEILEEDMLN DINEAELVGS TDRVRVVAQV DRYDGGFDGD GDWTSTKRFY
IEQDDDLEQM NSKELADLGE VNMADADTLT DFVTWAAKTY PSDKYVLIMS DHGAGWPGGW
SDPDPSTTGR HDIPLAESFG DMLFLMEMSE ALEHIIAETN IGEFELIGFD ACLMSHVEVY
SAIAPYARYA VASQEVEPSL GWAYAAILGR LTDSPEIDGA ELSRAIVDSY IEQDQQILDD
DARAKYVSRT YDFEGNVSAE EVLEQERKAI TLTAIDLGKL PAVIDALDRL VITLAEAERK
DIAAARRYTQ AFESVFDSDQ PKPYIDLGHF AQLLKQKVNV PSVNKAADEL IAAIDRSLIE
EKHGDEKAGA TGISIHFPNS KLYTSADAGY KSYNMVAVNF VNDSLWDEFL AFQYAKKPLP
TTIEQPTATV ERQPTPTPEP IDVTEVEAPG SEPITVAAIE LSSTTASLDQ PVVLTSSITG
DNIAFVYIFI GYYDQESDSI QVLDMDYLDA EQTREIGGVF YPDWGEASTI DIEFEWDTQV
FAMHNETTAS LALFSPEDYG ASPEDATYTV EGLYTTAKGK KTRRALLLFS NGELVQVLGY
TGKEDTGSLR EINPKRGDKF VVLDTWLEES QQTGESEFVN YEGETFVFGD DNFTWELEPA
PAGNYLVGFF AEDFDGNVYS AYETLIIE