Gene Haur_3111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3111 
Symbol 
ID5734983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3924733 
End bp3926151 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content49% 
IMG OID641280255 
Productcarboxyl-terminal protease 
Protein accessionYP_001545877 
Protein GI159899630 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000358486 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAT TCCAGCCCAC CTCGGGCGGT AGCTCAACAT CATCAGCAAA GCTATGGGTG 
GTGTTGAGTG GGATTGTTGG AGTCTTGTTG GTGGTCGCGA TTGCCCTTGG GGCTGGCTAT
TATTGGGGTA GCTCATCGAA AGAGCAGACG ATGACGGCGG CGAATCAAGC TTTAGCCACC
GAAACCGCCC AAATTATGCA AGCAACCCAA CAGGCGCTGC CCCCAGCCAA TGCCGATGAA
AACTTTCAAA CCTTTTGGGA AGTTTGGAAT CTGGTCAACA AAGAGTTTTA TCACACCGAG
CCAATCGACG AAAAACAAAT GATGTATGGC GCAATTCGCG GCATGCTCCA ATCGCTTGGC
GATGATTTTA CTGGGTTCCA AGAACCCGAA GCCGCCGAAC GCTCGCGCGA GGATATGCGC
GGCAATTTCG AGGGCATCGG AGCCTATGTC GAGTATAAAG ATGGCCAGAT CCTAATTGTT
TCGCCAATTG AGGGTTCGCC TGCTGAAAAA GCCAATGTGC GAGCTGGCGA TATTGTGGTC
GCGGTCGATG GCAAGCAAAT TAGTGAAGTC ATCGAGAATC TTGAACGCGA TCAAGCGCTT
GCAGAAGCCA TTAAGCTGAT TCGTGGCCCC AAAGGTTCGC AAGTCGTGAT TACGGTCTAT
CGTACCAGCG AAGAAAAGCA AATCGATATT ACGATTATAC GCGATACGAT TCCGTTGATC
AGCGTGCGCT CAAGCATGAT TGGCGATATT GGCTACATTC AATTGAGCGA ATTCAAGCAA
ACATCCTACG ATGAATTAGA CCAAGCAATT GCCAAACTCA AAACCAATAA CCCTAAGGCA
ATTATTTTTG ATTTGCGTAA CAATCCAGGC GGTTATGTCA ATCAAGCTCA AAATGTACTT
GGACGCTTTA CCAAAGATGG GGTAACCCAC TATCAAGAAA ATAGCGATGG TACGCAAAAG
GAATATCGAA CTTTGCAGCA AGGCGATGCC CAAGAATTAT TTGATCTCCC AGTTGTGGTC
TTGGTAAATG GTGGCTCAGC CAGCGCCTCG GAAATCGTCT CTGGTGCGAT GCAAGATACC
AAACGCGCAA CCCTGATTGG GGAAAAGACC TTTGGCAAGG GTTCGGTCCA AAGTGTGCAT
ACCCTGTCGG ATAAATCGGA AGCGCGGATT ACGATTGCCC ATTGGCTTAC TCCCAACAAA
CGGGCAATTC ATACGCTGGG GATTACCCCC GATTATGTTG TGCCGTTCTC GGATGATGCA
ACCCAATATC CAATTGAATG TATTTTGAAT CGCACACCTG CCGATGGGGC AACCAGTTGT
GCTGATTCAC AATTGTTCTG GGCGCTAAAG TTCTTGAACG AACAACAAAC CCCACCGCCA
CCGCCAACCC CAACGATTAC ACCAACCCCT GGCAAATAG
 
Protein sequence
MSEFQPTSGG SSTSSAKLWV VLSGIVGVLL VVAIALGAGY YWGSSSKEQT MTAANQALAT 
ETAQIMQATQ QALPPANADE NFQTFWEVWN LVNKEFYHTE PIDEKQMMYG AIRGMLQSLG
DDFTGFQEPE AAERSREDMR GNFEGIGAYV EYKDGQILIV SPIEGSPAEK ANVRAGDIVV
AVDGKQISEV IENLERDQAL AEAIKLIRGP KGSQVVITVY RTSEEKQIDI TIIRDTIPLI
SVRSSMIGDI GYIQLSEFKQ TSYDELDQAI AKLKTNNPKA IIFDLRNNPG GYVNQAQNVL
GRFTKDGVTH YQENSDGTQK EYRTLQQGDA QELFDLPVVV LVNGGSASAS EIVSGAMQDT
KRATLIGEKT FGKGSVQSVH TLSDKSEARI TIAHWLTPNK RAIHTLGITP DYVVPFSDDA
TQYPIECILN RTPADGATSC ADSQLFWALK FLNEQQTPPP PPTPTITPTP GK