Gene Haur_1828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1828 
Symbol 
ID5733716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2122807 
End bp2125023 
Gene Length2217 bp 
Protein Length738 aa 
Translation table11 
GC content47% 
IMG OID641278971 
ProductABC transporter related 
Protein accessionYP_001544599 
Protein GI159898352 
COG category[V] Defense mechanisms 
COG ID[COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0155639 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCAGTA AGACGTTAAT TTACCGCCGC TTACCATGGA TTCGTGCTTC AGAAGGCCGC 
GATTGCGGCC CCTCGGTTTT TGCATCAGTA GCGCATTTCT ACAAGCATCG AATTACCTTG
GAGCAAGCAC GGTCTTTGGT TGGAGCCGAT CGCAATGGTA CAAGTTTGGC AGGTTTGCGT
GATGGAGGCC GCGCAATTGG GCTTGAATCA CGGCCTGCTC AAGCGATTTA TAGCGCATTG
CAACATATCC AATTGCCAGC GATTGCCCAC CTCAAGGGTG GCGAAGGCCA TTATGTGGTT
ATCCATAAAT GGTCGCCAAC CTCAGTGGTT GTGCTAGACC CTAGCCGTGG TCTGCGCACG
CTCAGCAAAG CTGAATTTGA GGCCATGTGG AGTGGTTATT TAGTTGAATT TAAGCCTACC
GCAGCGCTCA AACCTCGCGA GATCGATGTT AAGCCCTTCA AAACCCTGTT GCAGCTCGCC
CGCCAACATA AACTTATGTT AGGAATTGTG CTCTTCTTCG CCTTGCTAGC AACCTCGTTA
GGCTGGGTAA CCTCGTTTTT TATGCAAATA TTGATCGATT CGATTATCCC AAATCGTGAT
CAAGCATTAC TTTTTGCCTT AGGTCTAGGG TTAATTTTGG TCAGTGTTTT CCAATCGACA
CTCCAATTTG GGCGTTTATG GCTTAGTGCC AAGGTTGGGC AGCATGTCCA CCAAGCCTAT
TCAGCTCAGT ATATTGATCG TCTGTTACGC CTACCAATGA AAGTATTTGA TGTTCGTTGT
ATCCCTGGAC TGGTGTTGCG GGTTACCCAA GCCGATGGGG TGCAATTAGC GCTTTCTGAA
GGATTGATTA CGATTTTGGC CGATGTAGCA ATGTTTGTAA CTGCATTAGG CATTATCGCA
TTCTATAACC CAATTGCGGC CTTGATCGCA GCAGCTGCCT TACCACTTGT TTGGTTTGTC
TTATTCGCGT TGAATGATCG GGTGTATAAC GCTCAATTAG CCGCAATCAT TCGGATGGAA
GAATTCACCT CGCAGATGGT CGATGTATTT GATTGTGTAC GAACAATCAA GGTTTTTGGC
GCAGAAGAAC AATATAAAGC TTTACTCAAC GAAAAATTGG CTAATTTCAC AAAATCTCGT
ATGGATAGCC GAATTAATAT CGCCCTGCCC AGCGCATGGA GCGTTTTAGC AACCTCATTA
ATTACTGCTT CAATTTTATG GTACGGCAGT AGCCAAGTAT TTGCTGGGCG CATGACTCCA
GGTGAGTTAG TTGTTTTGTT TGGAATGGTC GCATTTTATC TCCAACCAAT CCAGCGTTTA
CCTGCCACAA TTCTCAATCT GCGCACAGCG TTATTGGGCA TTGAACGGAT GGATGAAATT
ACCACCTTAC CAGACGAAGC TTCACGAACC AGCGAACCAA TCGCATTAGC TGAAGTCAAG
GGCGAAATTA AGTTCAATGA TGTGCATTTT GCCTATATGC GCAACAAAAT GGTGCTAAAA
AAGCTCAATT TTGAAATCAA GCCTGGCGAA ACTGTAGCAA TTGTTGGTGA AACTGGCTCA
GGTAAAACCT CGTTGGCTAA TTTGATCGCA GGTTTTTATC TCCCAACCCA CGGCGATGTA
TTAATTGATG GCATTAGCAC GCGCAATATC GACCCCGATG AATTACGACG CTCAATTAGT
GCCGTCTTTC AAAATACGCG GCTGCTGCAA CAATCAATTC GCGATAACAT CACCCTGATG
CGTGATACCG ATTTAGAATT AATTCGCAAT GCCGCCAAAA TTGCTCAGGC CGATGAATTT
ATTGCAGGCC AAATGTATGG CTACGAATCG CAAGTGGCAC GTGGTGGTGA TAATTTCTCT
TCTGGTCAAG GCCAACGCAT AACGCTTGCC CGCGCATTGC TCAAAAATGC ACCGATTTTA
ATTCTCGATG AAGCTACCAG CAACTTGGAT AGCGCCACCG AACAAGGGTT TTTACAAGCC
CTCGAAGATA ATCGGGCTGG TCGCACCACC GTGGTGATTG CCCATCGTCT GAGCACCATT
TTACGGGCCG ACCGCATTTT GGTGATGGAG AATGGCGAAA TTATCGAATC GGGCAGCCAT
GACCAACTTG TAGCCCAAGC TGGCCATTAT TACAACTTGA TCAAAGGCCA GATTACCAAG
CCCGCTCCCG AGCCAATTGT CATGCCTGAA ACCCATTTGA ACCAACTCGC GGCCTAG
 
Protein sequence
MLSKTLIYRR LPWIRASEGR DCGPSVFASV AHFYKHRITL EQARSLVGAD RNGTSLAGLR 
DGGRAIGLES RPAQAIYSAL QHIQLPAIAH LKGGEGHYVV IHKWSPTSVV VLDPSRGLRT
LSKAEFEAMW SGYLVEFKPT AALKPREIDV KPFKTLLQLA RQHKLMLGIV LFFALLATSL
GWVTSFFMQI LIDSIIPNRD QALLFALGLG LILVSVFQST LQFGRLWLSA KVGQHVHQAY
SAQYIDRLLR LPMKVFDVRC IPGLVLRVTQ ADGVQLALSE GLITILADVA MFVTALGIIA
FYNPIAALIA AAALPLVWFV LFALNDRVYN AQLAAIIRME EFTSQMVDVF DCVRTIKVFG
AEEQYKALLN EKLANFTKSR MDSRINIALP SAWSVLATSL ITASILWYGS SQVFAGRMTP
GELVVLFGMV AFYLQPIQRL PATILNLRTA LLGIERMDEI TTLPDEASRT SEPIALAEVK
GEIKFNDVHF AYMRNKMVLK KLNFEIKPGE TVAIVGETGS GKTSLANLIA GFYLPTHGDV
LIDGISTRNI DPDELRRSIS AVFQNTRLLQ QSIRDNITLM RDTDLELIRN AAKIAQADEF
IAGQMYGYES QVARGGDNFS SGQGQRITLA RALLKNAPIL ILDEATSNLD SATEQGFLQA
LEDNRAGRTT VVIAHRLSTI LRADRILVME NGEIIESGSH DQLVAQAGHY YNLIKGQITK
PAPEPIVMPE THLNQLAA