Gene Haur_3812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3812 
Symbol 
ID5735676 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4784665 
End bp4785999 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content53% 
IMG OID641280964 
Productpeptidase S41 
Protein accessionYP_001546576 
Protein GI159900329 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0205907 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTATC GTTGGTTGCT TTTGCTAGGA ATTGGATTGC TCGCTAGCTG TACTGCACAA 
TTGCCTTGGG CGGCCACACC AACTGTGCCA CCAACCACGG CTCCCCTCGC CTTGGCAACT
CCAACCTTTT TGCCCAGCCC AACCGCGCAG CCAAGCCTTG AGCCTACCCC AACCGTTGAG
CTAGCGCAAG CTACGCCAAC CGCGACAACC CCAATGAGTC CCGCCGAACG GCTAGCTTTA
TTCAATGATG TTTGGCAAAC GGTCAATGAA CATTATCTGT ATCCTGATTT TAATGGCGTG
GATTGGGCCG CCGTGCGTGC TGAAATCGAG CCGCAAGTGC AGGCTGCGCC CGATGATGAA
ACGCTCTACA CAATTCTCGA AGGCATGGTC GCCAAACTCG ATGATCAACA TTCGCGCTTT
GCCCGACCAG TCGAAGCAGT TTATGAAGAT GCCGTTGCCA GCGGAACCGA TAGCTATGTT
GGCATCGGTG TTTTGACAAT TCACGAAGAA AATGCCGCTT TTATTACCTT GGTTTTCCCT
GATAGCCCAG CCCAAGCGGC GGGCTTGATG CGCGGCGATC GCATTACCGC CGTTGAAGGC
CAGCCGTTTA CCAATGCCGA CCAAATTCGT GGCCCCGAAG GCAGCCAAGT GCGGCTGACC
ATTCAAACAC CGCAAGCTGA CCCACGCGAG TTGCTGATAA CGCGGCGGGC AGTAGTGGGC
AAAATTACGC CATCAGGCCG CCGCTTGCCC AATGCTCCAA CCGTTGGCTA TTTGCTGATT
CCCAGCTTGT GGGCCGACGA TATGCATACC CAAGTTGTCA GCGAACTCAG CAAATTGGTC
GCCGATCCTC AGCCGCTTGA TGGTTTGATT TTGGATCTGC GATCCAACGG CGGTGGCTGG
CGTAGCGTGC TTGAAGGCAT TTTGGGTCAA TTTGTCAGCG GCGAGGTTGG CAATTTCTAT
AGTCAGGAAA AATTGTATCC ATTAACTGTT AAACCTGGCC TGTTGTACGA GCAGCTGAAG
CAAGTGCCGC TAGCGGTGCT CATCGACAAA GATAGTGCTT CGTATGCTGA GGTTTTGGCT
GGAACGCTGC AATTTAATGG GGCTTTGGTG CTGGGCCAAG CCAGCCAAGG CAATACCGAA
ACGATTTTTC AATATAATTT TGAGGATGGC TCACGCTTGT GGGTGGCCCA AGAGGGCTTT
AAATTGCCTG ATGGCAGTAA TTTTGAAACC AAAGGTGTGC AGCCAAATAT TGTGGTCGAA
GACGATTGGA CCCAATACAC GATTCCGAAT GATCCGGCGG TATTGCAGGC GATTGTTTCA
TTTAGCGAGC GCTAG
 
Protein sequence
MRYRWLLLLG IGLLASCTAQ LPWAATPTVP PTTAPLALAT PTFLPSPTAQ PSLEPTPTVE 
LAQATPTATT PMSPAERLAL FNDVWQTVNE HYLYPDFNGV DWAAVRAEIE PQVQAAPDDE
TLYTILEGMV AKLDDQHSRF ARPVEAVYED AVASGTDSYV GIGVLTIHEE NAAFITLVFP
DSPAQAAGLM RGDRITAVEG QPFTNADQIR GPEGSQVRLT IQTPQADPRE LLITRRAVVG
KITPSGRRLP NAPTVGYLLI PSLWADDMHT QVVSELSKLV ADPQPLDGLI LDLRSNGGGW
RSVLEGILGQ FVSGEVGNFY SQEKLYPLTV KPGLLYEQLK QVPLAVLIDK DSASYAEVLA
GTLQFNGALV LGQASQGNTE TIFQYNFEDG SRLWVAQEGF KLPDGSNFET KGVQPNIVVE
DDWTQYTIPN DPAVLQAIVS FSER