Gene Haur_1868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1868 
Symbol 
ID5733757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2204861 
End bp2208103 
Gene Length3243 bp 
Protein Length1080 aa 
Translation table11 
GC content52% 
IMG OID641279012 
Productlanthionine synthetase C family protein 
Protein accessionYP_001544639 
Protein GI159898392 
COG category[V] Defense mechanisms 
COG ID[COG4403] Lantibiotic modifying enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACAC ACAAGCAGCT TGGTTGGGGT GCGACGCTCC CCTTGGCAAT GCGCAGCATG 
CATGTGCAGC AATGCTGTGA TCAACCGTTG AATGCTGCCA ATCAGCGACG TTTTGACCGC
TGGCGGGCGC AAACGCCCTT CCAAGATAGC GATTGGTTTT CCCAACGCCT CGCCCAGAAT
CAGCTTACCC CTGAGCAATT ACAACAGTTT TTACAAACGC CTGCCCAACA ATTTGAAGCC
GACCAAGCAG CACCCGACTG GGTTGCGCTC TTCGAAATGG CCTATCGCAC CCCGTTATCG
GAGCGGCTGC CTTATTCGGC CAAATTGTTA CAACACCCCA TGCATGGCTT GCTCTATCTG
GTTGAGCCAT TATTGGCCTA CGCGGTGCAA CACCTACAGC AACAACTCAG CGCAGTCCAA
CCTCGCCAAG CCTTTTTACA ACCAGCGCAA TGGGTCACGC TCTGGTTTGA GGCGTTAGCC
CAACGCTGTC TGTGGATGAT TAGCCGAGTA ACTGTGCTTG AGTTACATAT TGCGCGAGTT
CAAGCCCAAT TAAATGGCAC AACCCCCGAA GAGCGCTATC AAGATTTTGT GCAACAATTG
CAAGATCGTC AACGGGCACA GCAGTTGTTA CAAGCCTATC CAGTCTTAAT TCAGCAATTG
TGCTTAACCA TTCAGCGCTG GCAAACCAAT AGCATGACAA TCTTTGAACG CTTGAACCAC
GATTGGCCAG CCTTACAGCA GCTTTTTCCG GTGCTCCAAC AAACAGATGG ATTAATTGGC
ATCCAAACGG GCGCTGGCGA TGTGCATGCT GGCGGCCAAA GTGTGGTCAT TTTGAGCTTC
AGCAATGCCA AGCTCGTCTA TAAACCACGC TCATTGGCGA TTGACCAAGC ATTTCAAGAA
CTCCTACACT GGATCAACCA ACATTCGTCG ATGCTCCCTT TGCGTCTACT CAAAATTCTC
GATCGCCATG ACTATGGCTG GTCGGAATGG GTTGATCATG CGATGCTCAG CGACTCGGCA
GCAATTGAAC GTTTTTATCA ACGCCAAGGC ATTTATTTGG CCTTGTTGTA TGTGCTCAAT
GCCTCGGATT TTCACCACGA AAATATTATT GCCGCTGGCG AAGATCCAAT GTTTATCGAT
TTGGAATCGT TGTGTGGCCC CCAAGTGCAT AGCGATAATC AGCTTGAAAA CGAATTTATG
GCCAACCAAG TACTCACCAA CTCGGTCTTG AGGGTTAGTT TGTTGCCGGA GCGTTTTCAA
GCCCGGGCGG GCAAAACCGG CATCGATATT AGCGGCTTGG GCACACGCGA TGGTCAAACG
AGTATGGATA CCATGCCGCT GTGGGTCGAA GCTGGCACTG ATAGCATACG TATGACCAAG
CAAACCGTCA GCCTTTCTGG CAGCCAACAT AGCCCGATCG CCACCGTTAG CGCCGAGCAA
ATTGGCAGCT ACCTCAATTG TGTGGTCAGC GGCTTTGAGG CCATGTATGA TTTTTTGCTA
GCCCATCGCA GCGAATTACT GGCTGATAGC AGCCCCCTAG CCAACCTCTA CCGTTGCAAG
ATCCGCGTGA TTGCACGGCA CACAGCCTAT TACACCAAAA TTTATCAAGA GAGCTTTCAC
CCCGATGTAT TGCGCGATGC GCTTGATCGC GATTGGTTGT TTGATCGTTT ATGGTTTGAG
GTAAAATACA ATCAGCGTTT GGTCGAATTG ATTCCGTATG AACATCGCGA TTTATGGCAG
GGCGATATTC CGTTGTTTAC CACCGTCGTT GATTCATGCG ACCTGTGGAG CAGCGATGGT
CAACGCATTG CCGACTACTT GCCACGTTCA GGCAAAACAA TGGTGCTTGA GCGCCTACAC
CAACTCAGCT CCAACGATTT GGCCAACCAA GTACGCCTGA TTAGTTTCTC GTTTGCCACC
ATGAGCGCCT CACTCCACAA CAGCTATCGT TCAGATGAGC GTTATCTGTT GCCAACCAGC
ATCCAACCGA GCCATGCCGA TTGGCTGTTG GCCGCCTGCG GTGTTGGCGA TGAATTGCTG
GCAACCGCCT TGCAAAATGA CCATTCAATT ACGTGGATCG GCCTGACCCA ACAGCTAGAA
TTGCAAATGG TCGGGTTGGA TTTCTACGAC GGGTTGCCCG GCATCATCTA TTTCTTGGCC
TATCTCGGGG CAATTAGCGG GGTTGAACGC TACACCCAAG CGGCTGAGCG AGCACTGCAA
ACCCTCGAAC TGCTACTCGA ACAACATCAA GCCACATTAA CGGATGTTGG GGCATTTGTA
GGTTGGGGCG GCCTGAGTTA TCTCTACTGG CATTTATCGA GCTTGTGGCA ACAACCGGCC
TTATTGGAGA AAGCTAAAAC ATGGCTAGCC CAAATACCCT CACTGCTGGC CAACGATAGC
ACCTTTGATC TCATGGCAGG GGCGGCAGGT AGCCTCTTGG TAGGCTTGCG GCTTTATGCC
CAGCAACCTA GTGCTGAGTT ATATACAAGT TTAGTGGCCT GTGGTGAGCA CTTGCTTGCC
AATAGCCAAG CCGAGGAGCT TGGGCGCTCG TGGCAAACCA TCACCGATGC TGAGCAACCA
GCGTTGGGTG GTTTAGCGCA CGGTACTGCG GGCATCGCCT GGGCCTTGAT CGAGCTTGCC
CAATTAACCA ATGATCAGCG CTACCGTGAA TCTGGCTTGC AAGCCTTGGC CTACGACAAT
AGCCTATTTG TGGCCGATCA GCAGAATTGG CGCGATATTC GCACCGCCAA AACCAAGGGC
AACCAGCAAT CCGATGATCT GGTGATTTGT ATGGCGGCAT GGTGTCATGG AGCGAGCGGG
ATTGGGATTT CGCGCTTGGC AATGCTGCGC TGCCTCGATG ATCCAACGAT CAATCACGAC
CTACAACAGG CCTTGGCCAC CACCCTCACC CAAGGTTTTG GCATGAATCA CTCATTATGC
CATGGCGATT TTGGCAACTT GGCCTTGATT CAAGCAGCAG CCAAGCACTA CGCCGACCAG
CAGTTAGCCG ACCAAGCCCA AACGATTGCC AGCGAACTAT TTGCCAGCAT TCAACGAGAT
GGCTACCGCT GTGGGGTGCA ATATGGAGCA CAACCACCAG GCCTGATGAC CGGCATTGCC
GGCATTGGCT ATGGCTTGCT CCAACAAGCC GCCCCAAATG TCGTGCCATT AGTCACATTT
CTCGAAAGCC CTGCCATCTT CCCAACCAAC GAACCATTGC TGGAGGTGAT CAGTCGTGGT
TAA
 
Protein sequence
MTTHKQLGWG ATLPLAMRSM HVQQCCDQPL NAANQRRFDR WRAQTPFQDS DWFSQRLAQN 
QLTPEQLQQF LQTPAQQFEA DQAAPDWVAL FEMAYRTPLS ERLPYSAKLL QHPMHGLLYL
VEPLLAYAVQ HLQQQLSAVQ PRQAFLQPAQ WVTLWFEALA QRCLWMISRV TVLELHIARV
QAQLNGTTPE ERYQDFVQQL QDRQRAQQLL QAYPVLIQQL CLTIQRWQTN SMTIFERLNH
DWPALQQLFP VLQQTDGLIG IQTGAGDVHA GGQSVVILSF SNAKLVYKPR SLAIDQAFQE
LLHWINQHSS MLPLRLLKIL DRHDYGWSEW VDHAMLSDSA AIERFYQRQG IYLALLYVLN
ASDFHHENII AAGEDPMFID LESLCGPQVH SDNQLENEFM ANQVLTNSVL RVSLLPERFQ
ARAGKTGIDI SGLGTRDGQT SMDTMPLWVE AGTDSIRMTK QTVSLSGSQH SPIATVSAEQ
IGSYLNCVVS GFEAMYDFLL AHRSELLADS SPLANLYRCK IRVIARHTAY YTKIYQESFH
PDVLRDALDR DWLFDRLWFE VKYNQRLVEL IPYEHRDLWQ GDIPLFTTVV DSCDLWSSDG
QRIADYLPRS GKTMVLERLH QLSSNDLANQ VRLISFSFAT MSASLHNSYR SDERYLLPTS
IQPSHADWLL AACGVGDELL ATALQNDHSI TWIGLTQQLE LQMVGLDFYD GLPGIIYFLA
YLGAISGVER YTQAAERALQ TLELLLEQHQ ATLTDVGAFV GWGGLSYLYW HLSSLWQQPA
LLEKAKTWLA QIPSLLANDS TFDLMAGAAG SLLVGLRLYA QQPSAELYTS LVACGEHLLA
NSQAEELGRS WQTITDAEQP ALGGLAHGTA GIAWALIELA QLTNDQRYRE SGLQALAYDN
SLFVADQQNW RDIRTAKTKG NQQSDDLVIC MAAWCHGASG IGISRLAMLR CLDDPTINHD
LQQALATTLT QGFGMNHSLC HGDFGNLALI QAAAKHYADQ QLADQAQTIA SELFASIQRD
GYRCGVQYGA QPPGLMTGIA GIGYGLLQQA APNVVPLVTF LESPAIFPTN EPLLEVISRG