Gene Haur_1869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1869 
Symbol 
ID5733758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2209590 
End bp2211770 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content47% 
IMG OID641279013 
ProductABC transporter related 
Protein accessionYP_001544640 
Protein GI159898393 
COG category[V] Defense mechanisms 
COG ID[COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTTTA CCAAGCCAAA CACAAATCAG CTTCTAAAAC GCTACACTCA ACGGCGCAAA 
GTACCAGTTT TATTGCAAAT GAGCCAAATC GAGTGTGGTG CTGCATGTTT AGCAATGATC
TTAACCTATT ATGGCTATGA AATGAGTGTT GCTGAATGTC GTGAGCGCTG TGGCGTTGGC
CGTGATGGTA TTAGCGCAAA AACATTGGCT CAGGCAGCTC GCAGTTATCA ACTTGAGGTT
AAAGCGTTTT CATTTACCTA CGAATCCTTA TTAGCCTTGC GTCAACCAAT TATTATTCAC
TGGAATTTCA ATCATTTTGT GGTACTTGAA CGCACGACGG AAACCTATGC TGAAATCATT
GACCCGAACT TTGGCCGCCG ACGGATTGAT AAAGCCGAGT TTCTAACTGC ATTTACTGGG
GTTAGCTTGG TTATGCAGCC AAGTGCGAGC TTTCAACGCC GTAAAATTCG CCGCGAAACA
ATCTTACAAA ATTATATACA ACTCCTGACC CAGCATAAAA CGCTCTTTTT TCAATTGATT
TTTACCTCAA TTTTATTGCA ACTTGGCGGC TTAGTTGTGC CGCTATTCAC CAAAATAGTC
ATTGATACGG TTATTCCCCA AGCACTGATC CATGGAATGA CATTAATTGC ATTGGCGATT
ATGGTATTTA TCGCTATGCA AGGTGTGATT CAATTATTGC GCCAACAATT AACTATTTAT
CTGCAAACCA AGCTTGATTT GCAGATAATG CAACGCTTTT TTCGCCATCT CTTGGCTTTA
CCGTTTGTCT TTTTCGAGCA ACGCAGTAGT GGTGATTTAT TGATGCGCCT GAACAGTAAT
ACGATCATGC GCGAACTGAT TACCAGTCAA GTGTTGACCT TGATTCTCGA TGGTAGCTTG
GTGCTTGGCT ATTTCCTGCT CATTTGGTGG CAAAGTAACG TGTTGGCGGC GATTGTGCTT
GGCTTTGCCT TGCTCGAAAT TGGGTTGGTG CTGCTAGCCC AATCGCGCCT ACGCGAAATT
ATCAATCAAG ATCTTGATGC TCAAGCCAAA GCCCAAGGTT TTTTGGTTGA AGCATTGAAT
GGTATGACGA CCATCAAAGC CAGCGGCATC GAGCAACAAG TCTACGAGCA ATGGTCGCCG
CGCTATACCA ACCAATTAGT GTGGTCGCTG CGGCGCTCAC GTGCCAGTGC GGTGATCGAT
ACCGCGATCA ATTGTATTCA TATGAGTGCA GTGTTAGGAC TTTTGTGGTT TGGCACCCAA
TTGGTGCTCA ATCAGCAACT CAGCACCGGT TCACTGCTCG CACTTTTAGG TATTGCAGGC
GCATTCTTCG CGCCGCTAGC AATGCTGATT CGCACCGTTC AAAATATTCA ATTGGCGAAT
CTCTACTTCC AGCGAATTGC CGATGTCTTG CATAGTAACG TCGAACAACC AAACAAGCCA
ACCCCAAGTG CCTTAATGAG CGCTGGCCAG ATCGAACTCC GTGATGTGTC GTTTCGCTAT
AGCACCCATA GCCCAATTGT CTTGAAGAAT ATCCAACTAA CCATCAAGCC AGGCCAGAAA
GTTGCCTTGG TTGGCAAAAC TGGCTCGGGC AAAAGCACCC TCGCCAAACT GCTTTTGGGA
ATGTATCAGC CAAGCAGCGG GGCAATCTAC TACGATAAGC AGCCGACCGA AGCCTTTGAT
CTGGCGACAT TACGCCAGCA ATTTGGGGTT GTGCTCCAAG ATACATTTCT TTTCAGCGGC
TCAATTCGCC AAAATATTAC GCTCCAACGC CATGATCTTA AGCTTGCCCA AGTTATTGAG
GCTTGTCAAC AAGCAGCGAT TGCCAGTGAT ATTGAGGCCA TGCCGATGGG TTTGGAGACG
ATTTTGGCCG AAGGTGGGAG CAGCCTTTCG GGTGGGCAAC GCCAACGCTT AGCCTTAGCG
CGAGCGTTAG TTCATCAGCC AAGCGTGCTG TTGCTCGATG AAGCCACCAG CCACCTTGAT
GTTGCCACCG AAGCCGAAGT TGATCGCAAT CTTAATCACT TGGCTTGTAC GCGAATCGTG
ATTGCCCATC GCCTAAGCAC CATCGTCAAT GCCGACTTAA TTGTGGTGCT GCGTGATGGC
CAAATTATTG AGCAAGGCCG CCACGAAGAA TTGCTAGCGC AAGCGGGCTA TTATGCCCAA
CTTATTCAGC AACAAGCCTA A
 
Protein sequence
MIFTKPNTNQ LLKRYTQRRK VPVLLQMSQI ECGAACLAMI LTYYGYEMSV AECRERCGVG 
RDGISAKTLA QAARSYQLEV KAFSFTYESL LALRQPIIIH WNFNHFVVLE RTTETYAEII
DPNFGRRRID KAEFLTAFTG VSLVMQPSAS FQRRKIRRET ILQNYIQLLT QHKTLFFQLI
FTSILLQLGG LVVPLFTKIV IDTVIPQALI HGMTLIALAI MVFIAMQGVI QLLRQQLTIY
LQTKLDLQIM QRFFRHLLAL PFVFFEQRSS GDLLMRLNSN TIMRELITSQ VLTLILDGSL
VLGYFLLIWW QSNVLAAIVL GFALLEIGLV LLAQSRLREI INQDLDAQAK AQGFLVEALN
GMTTIKASGI EQQVYEQWSP RYTNQLVWSL RRSRASAVID TAINCIHMSA VLGLLWFGTQ
LVLNQQLSTG SLLALLGIAG AFFAPLAMLI RTVQNIQLAN LYFQRIADVL HSNVEQPNKP
TPSALMSAGQ IELRDVSFRY STHSPIVLKN IQLTIKPGQK VALVGKTGSG KSTLAKLLLG
MYQPSSGAIY YDKQPTEAFD LATLRQQFGV VLQDTFLFSG SIRQNITLQR HDLKLAQVIE
ACQQAAIASD IEAMPMGLET ILAEGGSSLS GGQRQRLALA RALVHQPSVL LLDEATSHLD
VATEAEVDRN LNHLACTRIV IAHRLSTIVN ADLIVVLRDG QIIEQGRHEE LLAQAGYYAQ
LIQQQA