Gene Haur_3197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3197 
Symbol 
ID5736899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4043339 
End bp4045588 
Gene Length2250 bp 
Protein Length749 aa 
Translation table11 
GC content53% 
IMG OID641280343 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_001545962 
Protein GI159899715 
COG category[C] Energy production and conversion
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases
[COG3794] Plastocyanin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0061261 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCTGA TCAAACCATT CATCCTCCTA ATGGTGTTAG CAGGCTTTTG TCTGCCAAGC 
GCCTCCGCCA ACCCCCCCGA CCGTCTCGCA AGCAAAGCCG ATTCAGCCCT GCTGGCAAAG
CTGAATGCTG GCCAACAAGT GCCAACCCTC GTGCTATTGC AAGCCCAAGT CGATACCAAC
TTCGCTGATC GTTTGGCCAG TAAAGAAGCC AAGGGTGCGG CGGTGGTTGC CGCCCTGCGC
CAACAAGCCC AACGCGATCA AACTCCACTG CTTGCCGAAC TCAGCAACCG CGGGATTCAA
TCCGAGGGCT TTCTCTCGGT TAATGCCTTG TATGCGACGC TTGATTTAGC CAGCGCTCAA
TGGCTGGCCG AACAAGCCAG CGTCAAGCAA TTGATCGAAG ATTCGATTGT GGTCAAGGTC
GAAAAACCAG TTGCCGAAAC TAACCCAGCA CCCCAAGCGG TGAATACCAC GACCTGGGGG
GTTAACTATG TCAAAGCCCC CGAAGTGTGG GCTAAAGGCA TTACTGGCCA AGGCATCGTA
ATTGCTGGCG AAGATACTGG GGTGCGCTGG ACGCACGCCG CATTGAAGAG CAAATATCGC
GGCTGGGATG GCACAAATGC TAGTCACGAT TACAACTGGT ACGACGGCAT TCGCACTTCG
TTAGGTGGCA CGAATCCTTG TGGTTTAGCG CTGAACGTGC CTTGTGACGA TAACTCGCAT
GGCACTCACA CCGTTGGTAC GGTTGTTGGC GATAATGGCA CGGGCGAACA AATTGGGGTT
GCGCCTGGCG CAAAATGGAT CGCCTGCCGT AACATGGATG CTGGCAACGG TACGCCTGCA
ACCTACATTC GCTGTATCGA CTGGATGTTG GCTCCGTTCC CCACTGCTGG CACCAGTGCT
CAAGGCGACC CCAGCAAAGC GCCGCACGTT GTCAACAATT CATGGGGCTG CCTAGCCTCA
GAAGGTTGTA CCAATACCCC TTCTGATGGC ATTCAGACCT CAGTTCAAAA TGTGACCAAT
GCAGGGATTA TGTTTGTGGC CTCAGCAGGG AATGATGGGA GCGGTTGTGC CACCATCACC
ACGCCAGTCG CGATCTACCC TGAAAGCTTT GTGGTTGGCT CACATACCTC AACTGGGGCA
ATTTCTGGGT TTAGCAGCCG TGGTCCAGCT ACCAACAATG GTGCCAGCCG AATCGGCCCA
GACATTTCAG CCCCAGGCTC ATCTGTGCGT TCGGCAACCA ATGGTGGTGA TGATGTTTAT
GGTTCAAGCT CGGGCACGAG TATGGCTAGC CCGCACGTAG TTGGGGTTGT GGCCTTGTTG
TGGTCGGCTC GCCCAGAACT GCAAGGCCAA GTTGATCTCA CTCGTGCGAT CTTGCAAGAA
ACCGCTACTG CTGCGCCTTC AACCCAAACC TGTGGTGGTG TTGCTGGTAG CAGCATCCCC
AACAATACCT TTGGTCATGG CTATGTCAAT GCTTTGAATG CGATTGCCCC AACCTTGCAA
GGCAGCATCA CGGTCGATGG CACGGCGGCA ACCTCAGCCA CCATTCGCTT GGAAAATAGC
GTTGGCGTGG TTGAATTTGG CAAAACCACT GGTGTCTATA GCACCAGCTT GCCAGCAGGC
CTCTACAGTG CAACCGTTAC TGTGCCAAAC GAAACGCCAA TTACTCGCCA AGTAACGATT
GTCAAAGGCC AAGTTGCGAC TGAAAACTTT GAATTTGGCG ATGTGACTGG TACTGTTAGT
GGTCATGCAA CCCTCAATGG CACGGGCCGC GCTGGTGTCA GCATCACCGC CAACCCCGGC
AACTTCACCA CCGAAACCAA CGCTAACGGC GATTACAACT TAGCCTTAGC CCCGGGAACC
TATACGATTT CTAGCGAATT TATGGCCTTG GAAACCCAAG TGGCAACGGT AACGGTGGTT
CTCAATCAGA CCGTCATCCA AGATTTTGAT TTCGCGACCA CCCAAACCAT CAATATTCAG
AACTTCCGGT TTAGCCCTAG CCCAATTACG GTTACCTTGG GAACCAGCAT CTTGTGGCGT
AACCTTGATG CTTCGACCCA CACCACGACC CGTGGTCAAA TGCCGTTTAT TTGGGATTCA
GGCGATCTGA GCCAGAACCA AGATTATGCC GTAACCTTTG ATCAAGTTGG TACTTTCAGC
TATGTTTGTA GCTTGCATGG CAGTATGCAA GGTACGGTTG TGGTAACTCC ACCAATGCAA
AATACCTATC TGCCGTGGAC AACTAAATAA
 
Protein sequence
MRLIKPFILL MVLAGFCLPS ASANPPDRLA SKADSALLAK LNAGQQVPTL VLLQAQVDTN 
FADRLASKEA KGAAVVAALR QQAQRDQTPL LAELSNRGIQ SEGFLSVNAL YATLDLASAQ
WLAEQASVKQ LIEDSIVVKV EKPVAETNPA PQAVNTTTWG VNYVKAPEVW AKGITGQGIV
IAGEDTGVRW THAALKSKYR GWDGTNASHD YNWYDGIRTS LGGTNPCGLA LNVPCDDNSH
GTHTVGTVVG DNGTGEQIGV APGAKWIACR NMDAGNGTPA TYIRCIDWML APFPTAGTSA
QGDPSKAPHV VNNSWGCLAS EGCTNTPSDG IQTSVQNVTN AGIMFVASAG NDGSGCATIT
TPVAIYPESF VVGSHTSTGA ISGFSSRGPA TNNGASRIGP DISAPGSSVR SATNGGDDVY
GSSSGTSMAS PHVVGVVALL WSARPELQGQ VDLTRAILQE TATAAPSTQT CGGVAGSSIP
NNTFGHGYVN ALNAIAPTLQ GSITVDGTAA TSATIRLENS VGVVEFGKTT GVYSTSLPAG
LYSATVTVPN ETPITRQVTI VKGQVATENF EFGDVTGTVS GHATLNGTGR AGVSITANPG
NFTTETNANG DYNLALAPGT YTISSEFMAL ETQVATVTVV LNQTVIQDFD FATTQTINIQ
NFRFSPSPIT VTLGTSILWR NLDASTHTTT RGQMPFIWDS GDLSQNQDYA VTFDQVGTFS
YVCSLHGSMQ GTVVVTPPMQ NTYLPWTTK