Gene Haur_4660 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4660 
Symbol 
ID5736507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5956671 
End bp5958155 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content50% 
IMG OID641281824 
Productpeptidase S10 serine carboxypeptidase 
Protein accessionYP_001547419 
Protein GI159901172 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.262908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGATG CAGGTAGCGC TGAAAAACGC AGCGATTCGG AAAGCCCCAA GCCACCACAA 
GAGATTGTGA GCGTGACCCA TGGGCGCGTG AAAATCAAGG GCAACGATGT GCCTTACACC
GCCACCGCCG GAACAATCGT GCTCTACGAA GACGATCCGG AGTTTAAGCA AGCGCCCAAG
GCCAAAGCGA CGGTATTTTA TGTAGCCTAT ACCCGCAGCG ATGTTGATGA TCAAACCACC
CGCCCAATCA CCTTTTCGTT CAACGGCGGG CCAGGTTCGG CCTCAGTTTG GATGCACCTT
GGCTTGTTGG GGCCAAAACG GGTTTTGATG GCCGATGAAA CTGGCAATTT GCCAGCACCG
CCATTTCGCT TGGTCGAAAA CGAATATTCC TTGCTCGATC AAAGCGATTT AGTTTTTATC
GATCCAATTA GCACGGGCTA TAGTCGTGCG GCAACTGGCG AAAATCCCAA CCAATTCCAT
CAATTTACCA AAGATATTGA ATCGATCAGC GATTTTATTC GGCTCTACAC CTCTCGGGCC
AAGCGTTGGC TCTCGCCCAA ATATATTATT GGCGAAAGCT ATGGCACAAC TCGCGGCTCT
GCCATCACCA ACTATCTGCA AAATCGCTAT GGCATGTATC TGAATGGGAT TATGCTGATC
TCCTCGATTC TCGATTTTCA AACCGTCGAG ATGGACCCAG GCAACGATAT TGCCTATGTG
GTGATTTTGC CAACCTACGC CGCGACTGCG TGGTATCACA ACAAGCTTGA TGCCAAATTA
CAATTGAGTT TGAGTGATAC CTTGGCCGAA GTTGAGGCCT TTGCTGCTGG CGAATATGCT
ACGGCATTGT TGCAAGGCGA TAGTTTGGCC GAGGGCAAAC GCCGCTCGGT TGTGCGTAAA
TTGGCTCGCT ATACCGGTTT GAGCGAACGT TTTATCGATC ACAGTAATCT ACGGATCGAC
TTGATGCGGT TTACTAAGGA ATTGCTGCGC GATCAGCAAC GCACGGTTGG CCGCTTGGAT
AGCCGCTTCG TGGGCATCGA CCGCGATCCA ACCCGTGAAG CCTTTGAGTA CGACCCAAGT
TATGCGGTAA TTCATGGACC ATACAGCGCC ACCTTCAACG ATTATGTGCG GCGCGAACTC
AAATTCGAGA GCGACGAACC CTATGAAATT CTAACTTCAA AAGTTCGGCC TTGGAAATAC
GATAAGCATG AAAATCAATA TGTGAGCGTC ACCGAAGCGT TGCGCTCGGC CATCTCGCAA
AATCCCTATC TCAAGGTGTT TGTGGCCAGC GGCTTCTTCG ATTTCGCCAC ACCCTACTAT
GCCACGTTGC ACACCTTCAA CCACCTTGGG CTTGACCAAA CCCTACGCAA TAACATCGTA
ATCAAGCATT ACGAGGCTGG GCATATGATG TATACCCATT TGCCGTCGCT GGCTGAGCTA
AAAAGCGACC TCGAAGCCTT TATCAGCCAA ACCAAAAACG TCTAA
 
Protein sequence
MADAGSAEKR SDSESPKPPQ EIVSVTHGRV KIKGNDVPYT ATAGTIVLYE DDPEFKQAPK 
AKATVFYVAY TRSDVDDQTT RPITFSFNGG PGSASVWMHL GLLGPKRVLM ADETGNLPAP
PFRLVENEYS LLDQSDLVFI DPISTGYSRA ATGENPNQFH QFTKDIESIS DFIRLYTSRA
KRWLSPKYII GESYGTTRGS AITNYLQNRY GMYLNGIMLI SSILDFQTVE MDPGNDIAYV
VILPTYAATA WYHNKLDAKL QLSLSDTLAE VEAFAAGEYA TALLQGDSLA EGKRRSVVRK
LARYTGLSER FIDHSNLRID LMRFTKELLR DQQRTVGRLD SRFVGIDRDP TREAFEYDPS
YAVIHGPYSA TFNDYVRREL KFESDEPYEI LTSKVRPWKY DKHENQYVSV TEALRSAISQ
NPYLKVFVAS GFFDFATPYY ATLHTFNHLG LDQTLRNNIV IKHYEAGHMM YTHLPSLAEL
KSDLEAFISQ TKNV