Gene Haur_5125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5125 
Symbol 
ID5737083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp171840 
End bp173249 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content59% 
IMG OID641282290 
Productpeptidase C14 caspase catalytic subunit p20 
Protein accessionYP_001547881 
Protein GI159901635 
COG category[R] General function prediction only 
COG ID[COG4249] Uncharacterized protein containing caspase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.499707 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAACCT TTGACCACGG CTATGCCCTT CTGATCGGCG TTGGAACTCA CCACCATCCC 
GCTTACTCGT TACCCGTCAC GGCCAATGAT GCGCACGCGA TCCAACGCAT CCTCAGTGAT
CCTCGCTGGT GTGCCTATCC CACAGCGCAT ATTCGACTCC TGTGCAACGA AGCGGCCACA
ACCGACGCGG TACTCGATGG CTTCGCGTGG CTTACCGCCT GTGCGCAGCG TGATCCGCAG
GCCACGATTG TTGTCTACTA TTCCGGCCAT GGAGGTCGCA ATGAGGAGAC TGATACCTAC
TGTTTGCTGC CCCATGATGC GACCACGGCA ACCAGCCGTC TCTCAACCGC TACCCTGAGC
GCTGCATTGG ATGCGATCCC GGCCAAACGC CTCCTCGTCT TGCTCGATTG TTGTCACGCG
GGAGGCATGA CCAAAGATGG CCCAGACCAG GACAGCCAAT GGACAACCAC GGCTCCCAGC
GCCAGCATGC TCGCCGCATT ACAGCAGGGT GCGGGTCGCG TTGTCCTGTC CTCATCGACG
GGCAAACAAC GGTCGTACAT TCAGCCTGAT GCAAGCCTCA GTCTCTTTAC CACCCATCTC
CTTGCCGCAT TCCAAGGGGC AGGCAATCAA CCAGGGAGTA CCGAGGTGCG CGTGACCCAT
CTGATTCAGT ATGTCAGTCG TCAGGTGCAG CACGCAGCAC AGGCGCTCAA TGCCCGGCAG
ACTCCGTTCT TCCAGGCGGC AAGTGAAGAC TTTGTGGTCT CCTTGATTCG TGGTGGTGCA
GGGCTGGGCA AAGGGGGATG GGAGGCACTC CAGCAGGACG CACACGCCGC AATTGCCGCC
GCATCCCCAC CATCGCCACA TCCGACAATA ACCGATAGCA ATGTGGCCAT TGGCAACGTC
GTCGGAGGCA ACCTCACGCA AACGAAGCAG TCAGGTGGCG TACACGCCGA GGGAGCCACC
ATTGGTCGGA TTGGGACACT GACGGGTGGC GATAGCTATG GCGGGGATCG GATCGAAGGC
TCCAAGACCG TGTATGATGG CCGAGCCACC GTCAACGACA GTACGATCAC AGGAACCGTG
ACCGGACTCA GTACAGGCAC CATCATCCAC GGTGGAAGCG ACAGAGCAGC GCCTGCCCAT
CCCTCGCTCA CCATGGAATG TCCCGAAATG ATGGTTTGTG GTCACGACCA TGCAGTTATC
ATCCATGTTC ATGGGACGAA CCCCACGCTG CGCTATCAGC TTGATCTGCG GGCGACAGGT
ATCCATGAAA CAACCTATTT TATGGGGAGC GAAGTCACGC TGTATGTTAC GCCCTCGATG
CCGGGAGAGC TTCACCTGCG GGCGATCCTG AAGGCGCAGG ACGGAACCCA GATGACCAGC
CAGAGCCAGC GTATCATGGT GCACCGATAG
 
Protein sequence
MATFDHGYAL LIGVGTHHHP AYSLPVTAND AHAIQRILSD PRWCAYPTAH IRLLCNEAAT 
TDAVLDGFAW LTACAQRDPQ ATIVVYYSGH GGRNEETDTY CLLPHDATTA TSRLSTATLS
AALDAIPAKR LLVLLDCCHA GGMTKDGPDQ DSQWTTTAPS ASMLAALQQG AGRVVLSSST
GKQRSYIQPD ASLSLFTTHL LAAFQGAGNQ PGSTEVRVTH LIQYVSRQVQ HAAQALNARQ
TPFFQAASED FVVSLIRGGA GLGKGGWEAL QQDAHAAIAA ASPPSPHPTI TDSNVAIGNV
VGGNLTQTKQ SGGVHAEGAT IGRIGTLTGG DSYGGDRIEG SKTVYDGRAT VNDSTITGTV
TGLSTGTIIH GGSDRAAPAH PSLTMECPEM MVCGHDHAVI IHVHGTNPTL RYQLDLRATG
IHETTYFMGS EVTLYVTPSM PGELHLRAIL KAQDGTQMTS QSQRIMVHR