Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1828 |
Symbol | |
ID | 5733716 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2122807 |
End bp | 2125023 |
Gene Length | 2217 bp |
Protein Length | 738 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641278971 |
Product | ABC transporter related |
Protein accession | YP_001544599 |
Protein GI | 159898352 |
COG category | [V] Defense mechanisms |
COG ID | [COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0155639 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCAGTA AGACGTTAAT TTACCGCCGC TTACCATGGA TTCGTGCTTC AGAAGGCCGC GATTGCGGCC CCTCGGTTTT TGCATCAGTA GCGCATTTCT ACAAGCATCG AATTACCTTG GAGCAAGCAC GGTCTTTGGT TGGAGCCGAT CGCAATGGTA CAAGTTTGGC AGGTTTGCGT GATGGAGGCC GCGCAATTGG GCTTGAATCA CGGCCTGCTC AAGCGATTTA TAGCGCATTG CAACATATCC AATTGCCAGC GATTGCCCAC CTCAAGGGTG GCGAAGGCCA TTATGTGGTT ATCCATAAAT GGTCGCCAAC CTCAGTGGTT GTGCTAGACC CTAGCCGTGG TCTGCGCACG CTCAGCAAAG CTGAATTTGA GGCCATGTGG AGTGGTTATT TAGTTGAATT TAAGCCTACC GCAGCGCTCA AACCTCGCGA GATCGATGTT AAGCCCTTCA AAACCCTGTT GCAGCTCGCC CGCCAACATA AACTTATGTT AGGAATTGTG CTCTTCTTCG CCTTGCTAGC AACCTCGTTA GGCTGGGTAA CCTCGTTTTT TATGCAAATA TTGATCGATT CGATTATCCC AAATCGTGAT CAAGCATTAC TTTTTGCCTT AGGTCTAGGG TTAATTTTGG TCAGTGTTTT CCAATCGACA CTCCAATTTG GGCGTTTATG GCTTAGTGCC AAGGTTGGGC AGCATGTCCA CCAAGCCTAT TCAGCTCAGT ATATTGATCG TCTGTTACGC CTACCAATGA AAGTATTTGA TGTTCGTTGT ATCCCTGGAC TGGTGTTGCG GGTTACCCAA GCCGATGGGG TGCAATTAGC GCTTTCTGAA GGATTGATTA CGATTTTGGC CGATGTAGCA ATGTTTGTAA CTGCATTAGG CATTATCGCA TTCTATAACC CAATTGCGGC CTTGATCGCA GCAGCTGCCT TACCACTTGT TTGGTTTGTC TTATTCGCGT TGAATGATCG GGTGTATAAC GCTCAATTAG CCGCAATCAT TCGGATGGAA GAATTCACCT CGCAGATGGT CGATGTATTT GATTGTGTAC GAACAATCAA GGTTTTTGGC GCAGAAGAAC AATATAAAGC TTTACTCAAC GAAAAATTGG CTAATTTCAC AAAATCTCGT ATGGATAGCC GAATTAATAT CGCCCTGCCC AGCGCATGGA GCGTTTTAGC AACCTCATTA ATTACTGCTT CAATTTTATG GTACGGCAGT AGCCAAGTAT TTGCTGGGCG CATGACTCCA GGTGAGTTAG TTGTTTTGTT TGGAATGGTC GCATTTTATC TCCAACCAAT CCAGCGTTTA CCTGCCACAA TTCTCAATCT GCGCACAGCG TTATTGGGCA TTGAACGGAT GGATGAAATT ACCACCTTAC CAGACGAAGC TTCACGAACC AGCGAACCAA TCGCATTAGC TGAAGTCAAG GGCGAAATTA AGTTCAATGA TGTGCATTTT GCCTATATGC GCAACAAAAT GGTGCTAAAA AAGCTCAATT TTGAAATCAA GCCTGGCGAA ACTGTAGCAA TTGTTGGTGA AACTGGCTCA GGTAAAACCT CGTTGGCTAA TTTGATCGCA GGTTTTTATC TCCCAACCCA CGGCGATGTA TTAATTGATG GCATTAGCAC GCGCAATATC GACCCCGATG AATTACGACG CTCAATTAGT GCCGTCTTTC AAAATACGCG GCTGCTGCAA CAATCAATTC GCGATAACAT CACCCTGATG CGTGATACCG ATTTAGAATT AATTCGCAAT GCCGCCAAAA TTGCTCAGGC CGATGAATTT ATTGCAGGCC AAATGTATGG CTACGAATCG CAAGTGGCAC GTGGTGGTGA TAATTTCTCT TCTGGTCAAG GCCAACGCAT AACGCTTGCC CGCGCATTGC TCAAAAATGC ACCGATTTTA ATTCTCGATG AAGCTACCAG CAACTTGGAT AGCGCCACCG AACAAGGGTT TTTACAAGCC CTCGAAGATA ATCGGGCTGG TCGCACCACC GTGGTGATTG CCCATCGTCT GAGCACCATT TTACGGGCCG ACCGCATTTT GGTGATGGAG AATGGCGAAA TTATCGAATC GGGCAGCCAT GACCAACTTG TAGCCCAAGC TGGCCATTAT TACAACTTGA TCAAAGGCCA GATTACCAAG CCCGCTCCCG AGCCAATTGT CATGCCTGAA ACCCATTTGA ACCAACTCGC GGCCTAG
|
Protein sequence | MLSKTLIYRR LPWIRASEGR DCGPSVFASV AHFYKHRITL EQARSLVGAD RNGTSLAGLR DGGRAIGLES RPAQAIYSAL QHIQLPAIAH LKGGEGHYVV IHKWSPTSVV VLDPSRGLRT LSKAEFEAMW SGYLVEFKPT AALKPREIDV KPFKTLLQLA RQHKLMLGIV LFFALLATSL GWVTSFFMQI LIDSIIPNRD QALLFALGLG LILVSVFQST LQFGRLWLSA KVGQHVHQAY SAQYIDRLLR LPMKVFDVRC IPGLVLRVTQ ADGVQLALSE GLITILADVA MFVTALGIIA FYNPIAALIA AAALPLVWFV LFALNDRVYN AQLAAIIRME EFTSQMVDVF DCVRTIKVFG AEEQYKALLN EKLANFTKSR MDSRINIALP SAWSVLATSL ITASILWYGS SQVFAGRMTP GELVVLFGMV AFYLQPIQRL PATILNLRTA LLGIERMDEI TTLPDEASRT SEPIALAEVK GEIKFNDVHF AYMRNKMVLK KLNFEIKPGE TVAIVGETGS GKTSLANLIA GFYLPTHGDV LIDGISTRNI DPDELRRSIS AVFQNTRLLQ QSIRDNITLM RDTDLELIRN AAKIAQADEF IAGQMYGYES QVARGGDNFS SGQGQRITLA RALLKNAPIL ILDEATSNLD SATEQGFLQA LEDNRAGRTT VVIAHRLSTI LRADRILVME NGEIIESGSH DQLVAQAGHY YNLIKGQITK PAPEPIVMPE THLNQLAA
|
| |