Gene Haur_2849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2849 
Symbol 
ID5736886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3613979 
End bp3617461 
Gene Length3483 bp 
Protein Length1160 aa 
Translation table11 
GC content52% 
IMG OID641279992 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001545615 
Protein GI159899368 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00446023 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATCGTT TCCTACCACG TTGGGCTGTG TTATTGAGCT TACTCACCTT GGTGATTAGC 
TTATTTGCGC AGCCATTAGC CACGACAGCT CAAAAAGCTC CACTAACAAC TGATTATGCC
CCTGCGACGA GCGACAGTGT TGCCCCAGTC GCAACGCCTT CGCATCGCTT GATTATCGAG
TTGCAATCGC CAGCATTGGC TGCATGGAGC AAAAACACCA ATAAAGCTCG CAACGCCAAC
CAACGTTTGG ATCTGAAGGC GGCTGATGCC CAAACCTACC TGGCGCAACT CGAAGCTGAA
CAAAATCGCT TCGTAACTGA TATGCGCCAA GCCTTGCCTG GAGCCAGCGT CGAACGCTTT
GTCAATGAAT TTGGCGACCT CGACGAACTG CGCTATAGCG TGGTGTTTAA CGGGATGACC
GTCAACGTTG GCAATACCAA ACGTGATGAC GCTCGCAAAA TTCTGGAAGC CTTGCCCAAT
GTCAAGGGTG TCTATCTCGA CTTACCACAG CAAGCCGATT TGCATGTAAG CAACACCCTA
ATTGGTTCGC CAGCCTTGTG GAGCAGCTCG GCAATTGGTG GCCGCAACAA CGCTGGAGCA
GGCATCAAAA TTGCCTCGAT GGACGGTGGC GTGCATAAAG ATGCCCCTAT GTTCAGTGGC
ACGGGCTACA GCTACCCTGC GGGCTATCCA GCCAATGGTT TGGGTTTGAC CCAAAACAAC
AACGGTAAAA TTATCGCCTC ACGCACCTAC TTCCGCACTT GGGATGGCCC CGCTGATGGC
GACCAAAACC CATGGCCAGG CACGAATGGC ACACCGCACG GGGTGCACAC CGCTGGGATT
GCCGCTGGCG ATGTCGTAAC CGCAACCTTT GGTGGCCTGA ATTTGCCATC AACGAGTGGG
GTTGCGCCTA AAGCATGGGT TATGAGCTAT CGCGTGTTCT ATGCCAGCGT CAATGGCATT
GGCTCGTTCT ACAATGCTGA AGGGATTGCT GCACTCGAAG ATATCGTGGC TGATGAGGCT
GATGTCGTTA ATAACTCATG GGGCGGTGGC CCTGGCAGCA TCGGCGGCGA ATTCGATGCG
CTTGATACAG CCTTGATCAA CACCTCAAAT GCTGGAATTT TTGTTTCAAT GTCGGCAGGC
AATGCTGGCC CCAACAAAGG CACCAGCGAT CATCCTTCCG ATGAATATAT TGTGGTGGCG
GCCAGCACGA CCCAAGCAAC CTTTGCTGCT GGGCAGTTGA ATGTGACCCA ACCAACCCCA
ATTTCACCAA CCTTGCAAAC AATTCCGATT GCGGCTGCGA GCTTTGGCGG GCCAATTAAT
GCCCTTGTTA GCAACAACTA CCTGCCTGCT AGCGTGATTT CGCCAACCAA TGCGCTTGGC
TGTAGCCCAT TCGCGCCAAC GACCTTTACC GGCAAGATCG CCTTGATTCA ACGTGGTACG
TGTGAATTCG GGGTTAAAGC GCTGAATGCT CAAAACGGCG GCGCAAGCTT TGTGATTATT
TACAACAATG CTGCCAATGG CAACACCTTG ATCAACATGG GTGCTGGTGC AGTTGGTGCG
CAAGTCACTA TTCCGGCAAT TATGATTGGC TTTAACCAAG GGACTGGTTT GGTCAATTGG
TACGCTCAAC ATGGCGCTGC GTCAGTTGCC GAAATCAACC CATCGACCTA TCTTGCCCCA
AGCACTGCCG ATGTGATTGC TGGCTTTAGT AGCCGTGGCC CTGGCGTTGG CGATGTGTTG
AAACCAGATG TTACCGCACC TGGGGTTAAT ATTCTTTCCC ATGGCTATAC ACCTGGGGCC
AGCGGCGAAG CCCGCCACCT TGGCTATGGC ACTGCCTCGG GCACGTCGAT GGCTGCGCCA
CACGTTGCTG GTGCTGCTGC CTTGTTGCGC CAAGCTCATC CCAGCTGGAC CAATGACCAA
ATCAAATCGG CTTTGATGAG CACCTCAAAG TATATTGGGG TGACGGTTGC TGATGGTTCG
CCAGCTCAAC CCTTGGATAT GGGCGCTGGT CGGATCGATT TGAGCAAAGC CAGTGATCCA
GGCGTTTTCT TAAGCCCACC AAGCTTGAGC TTTGGTCAAG TATTGACCGG TACAACCAAG
TCGATTCTGG TGACGGTGAC CAACGCTACC AATGTTGCTG AAACCTACAA CATTAGCACC
CAATACACGG GCGGCGGTTT TAACAACATT ACCCCAATGG CTGGGGTAAC ACTCGCTTCA
AACACCATTG CCGTACCAGC CAATGGTAGT GCCCAATTTA CGGTTACCTT CAATAGTATG
GCTGGGCGTG GCTATGGCGA TAATCAAGGC TTTATTGTGC TTGATGGCCC AAATCACGAT
GCCCACATGC CTGCATGGGC ACGTGTAACC AAGCCAATGG CCAATGTTGA TGTGTTGGTG
ATTGATAACG ATGGTAGTGC TTCGCTTGGC GGTGCCTATA TCGATGTTAC CCGCTACTAC
ACCGAGACCC TTGAAGCAAT TGGCTTGACC TATGACGTTT TGGATGCTGA TGATTTGGCT
GGCAGCGTAA CGACCTTCTT GCCACCAGCT GAACAACTCT ATCCATACAA GGCGATTCTC
TACTTCACTG GCAATTACAA CCGCCGCAAT GGTGAGTTTA CGGTCGCCAC ACCATTAACC
GCCTTGGATA TGGATCGCTT GACCGAGTAT GCCAACAATG GCGGGACGAT CATCGCCATG
GGTCAGAATT TGGCGCAAGT GCTCAACTCA ACCAGCTCCA CCACCGAATC ATTCTTCTAT
AGTACGGTGT TGAGTGGTGA ATACGTTCGG GCCAATATCA ACTCATCGAA TGCGATGACG
GTTACATCGC TGATCACTGC CACAACCCAA GCACCACAAG TCTTCCAAAC CATGGAGATT
GATATTGATG CTAGCGAGAC TGCTGGCGGC GGCGCTCGTA ACCAAACCAG CGTTGATGCG
CTTGAGTCGC TCTATGGTAC CGACTTCGAT CCCAATCGCG ATCCATTAAT TCGCTCATTG
TTCTCAATCG AAGGCGAAGA AGATGTAACG GCTGGGCGGC TGCATCGTGC CCAACCAAGC
CTGGAAACCC CAGGTATCAG CTATCTCGGA CGCACCATCT ACACCACCTT TGGTTTAGAA
GGGGTCAATA ACTTGGGTGA TACCACCAGC CGTGAAGCGT TGCTCGAAAC CTTCTTCAAG
GTCTTGTGGG ATGATGCAAC CAGCATGATC ATGCCAACCT CAGTCGGCTC AAAAGTATGG
CTCGACGTTG ATTTGACCTC GGAATATGAT GCTGTGCCAG TTGAATATCG CTGGGACTTC
GGCGATGGTT CAGCCTATAT GACGGTACCA GCAGGTACTC CGGTGGCTCA TAGCTATGCA
ACACCAGGCC AAGTCTACAC TGTTCGGGTG GAAGTAACCG ACAGCTATGG CAATAAGACC
TTGGCAACCC GCCAAGTTAC CGCACCATGG GGTGTCTACT TACCAGTGGT TACCAAAAAC
TAG
 
Protein sequence
MNRFLPRWAV LLSLLTLVIS LFAQPLATTA QKAPLTTDYA PATSDSVAPV ATPSHRLIIE 
LQSPALAAWS KNTNKARNAN QRLDLKAADA QTYLAQLEAE QNRFVTDMRQ ALPGASVERF
VNEFGDLDEL RYSVVFNGMT VNVGNTKRDD ARKILEALPN VKGVYLDLPQ QADLHVSNTL
IGSPALWSSS AIGGRNNAGA GIKIASMDGG VHKDAPMFSG TGYSYPAGYP ANGLGLTQNN
NGKIIASRTY FRTWDGPADG DQNPWPGTNG TPHGVHTAGI AAGDVVTATF GGLNLPSTSG
VAPKAWVMSY RVFYASVNGI GSFYNAEGIA ALEDIVADEA DVVNNSWGGG PGSIGGEFDA
LDTALINTSN AGIFVSMSAG NAGPNKGTSD HPSDEYIVVA ASTTQATFAA GQLNVTQPTP
ISPTLQTIPI AAASFGGPIN ALVSNNYLPA SVISPTNALG CSPFAPTTFT GKIALIQRGT
CEFGVKALNA QNGGASFVII YNNAANGNTL INMGAGAVGA QVTIPAIMIG FNQGTGLVNW
YAQHGAASVA EINPSTYLAP STADVIAGFS SRGPGVGDVL KPDVTAPGVN ILSHGYTPGA
SGEARHLGYG TASGTSMAAP HVAGAAALLR QAHPSWTNDQ IKSALMSTSK YIGVTVADGS
PAQPLDMGAG RIDLSKASDP GVFLSPPSLS FGQVLTGTTK SILVTVTNAT NVAETYNIST
QYTGGGFNNI TPMAGVTLAS NTIAVPANGS AQFTVTFNSM AGRGYGDNQG FIVLDGPNHD
AHMPAWARVT KPMANVDVLV IDNDGSASLG GAYIDVTRYY TETLEAIGLT YDVLDADDLA
GSVTTFLPPA EQLYPYKAIL YFTGNYNRRN GEFTVATPLT ALDMDRLTEY ANNGGTIIAM
GQNLAQVLNS TSSTTESFFY STVLSGEYVR ANINSSNAMT VTSLITATTQ APQVFQTMEI
DIDASETAGG GARNQTSVDA LESLYGTDFD PNRDPLIRSL FSIEGEEDVT AGRLHRAQPS
LETPGISYLG RTIYTTFGLE GVNNLGDTTS REALLETFFK VLWDDATSMI MPTSVGSKVW
LDVDLTSEYD AVPVEYRWDF GDGSAYMTVP AGTPVAHSYA TPGQVYTVRV EVTDSYGNKT
LATRQVTAPW GVYLPVVTKN