Gene Haur_3972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3972 
Symbol 
ID5735833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5050094 
End bp5053474 
Gene Length3381 bp 
Protein Length1126 aa 
Translation table11 
GC content65% 
IMG OID641281122 
Productamino acid adenylation domain-containing protein 
Protein accessionYP_001546732 
Protein GI159900485 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.47287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACACAA AAGAAGAACT CTTGAAGCGG CGCGCCCAGC TTTCGGCCAA TCAGCGCACG 
ATGCTGGACA AGCAACTCCG TGGGGAAGCG ATGCTCCAGG CCGATCCAGG CTACGCCGCG
ATCGGCAAAC GGCCGGCCGA TGCGCCGACG CCGCTGTCTT TTTCCCAGGA GCGCCTCTGG
TTTTTGCAGC AGCTTGACCC CGCCAACGCC GCGTTCAATG GCCTGCGGCC CATGATGATC
GCCGGTCCGC TGGACATCGA TCTGATCAAC CGTGTCGCCA GGGAGATGTA CCGGCGCCAC
GAGATCCTGC GCACCAGCTT CCGCTTGATC GATGGTGCTC CATGCCAGAT CGTGGACCAG
GATCTGCCCC GCACCTACGC CGTGCCGGTC ATCGATCTGG GCCACCTGCC GCCCGGCGAC
GCCGAGGCCG AGGCACGCCG GCTGATGGCC GAGGAGGCGC GCACGCCGTT TGACATGGGC
CAGGCACCGA TGATGCGTCT CACTCTGATC AGCCTGGACG CCGCCGAGCA TGCGCTGCTG
CTGAGCCTTC ACCATATCGC CTATGATGAA TGGTCTAATC AGGTGCTGGT CGATGAGTTC
TCCGTGTTGT ATCGGGCCTT CGCCCAGGGC CAACCCTCGC CCTTGCCCGA GCTGCCGATC
CAGTACGGCG ATTACGCCTA CTGGCAGCGC GAGTGGTTGC AGGGATCGGT GCTGGAAGCC
CAGCTCGCCT ACTGGGCCGA GCAGCTCGGC GGCGCGCTGC CGACCCTGGA TCTGCCGACC
GACCATCCCC GGCCTGCGGT CCAGAACTTC CGCCTCCGTA CCGAGCAGAC CCTTCTGCCC
GCCGAGCTGG CCGCCGCCCT GCGGGCGATG AGCCAGAAGG AAGGCGTGAC CCTGTTTATG
GCCCTGCTAA GTGCCTTCAA AACCCTGCTT TTTTACTATA CAAATCAACC TGAGAGCATT
GTCGGCACCT TTATCGCCGG GCGGAGCCGG CCGGAACTTG AGCGACTGAT CGGCTTTTTT
GTTAATTCGC TGCCGCTGCG CAGCGACCTT ACCGGCGATC CCACCTTCTC CGAGGTGCTC
AAGCAGGTGC GCGCGGTGAC CCTGGGCGCC TACAACCACC AGGATGTGCC TATCGAGAAG
CTGATCGAGA CCTTCACGCC CAAGCGTGAC CTCAGCCGAA CGGCTATCTA CCAGGCGATG
TTTGTCTTGC AGAACGTTCC CAAGCCCGGT GATGCCGCCG AGCCGTCGGC CCTAGTGATC
CGCGAGTGGC AGGATGCCGA CGCTTCGGCC GGCGCCGACC TCCAATGCGA CATTACCCTG
ATGGTCTACG AGTTGCCGGA TGGCGGCCTG CGCTGCCAGT TCGAGTACGA CTCGTCACTG
TTCGAGGCGG CGACGATCCA ACGCATGCTC GCTCAATTCG AGACTCTGCT CGCGGCGGTC
TCCAGCAACC CCGGCCAGCG CCTCTCGCGT TTGTCCTTGC TGACCGACCA AGAGCGCCAG
CAGGTGCTCT ATGACTGGAA TGCCACTGCG GTGCCGTTCG CCGTGGACAG CTGCATCCAT
ACGCTGTTCG AGTCCCAGGC GGCCAGAGCA CCCCAGGCCA TCGCCCTAGT GCATGGCAAG
GAGCGTCTGA CCTACGGCGA GCTTAACCGG CGGGCCAACC AGCTGTCCCA CTACCTGCGC
ACGAGCGGAG TGGGCTCCGG GGGTTTTGTC GGGCTGGCCC TGGAGCGATC GGTGGAAATG
GTGGTCGCGG TGTTGGGGGT GCTCAAGGCC GGCGCCGCCT ATGTCCCACT CGACCCGACC
TATCCGGCGG CGCGCCTCCA GTTTATGCTC GCGGACGCCG ACGTCGGCTT CGTGCTGACC
ACGGGGCGCC TGCGTGATCG TCTGGCCGGC ACGGACCGCA CGCTCCTGGA GTGGGAAGCG
CTCGGAAACC TGGACGCGTA TCCGCCCGAC GATCCGCCGG CTCGCGCAAC TGCCGCAAGC
CCGGCCTATG TGATCTACAC CTCGGGATCA ACTGGCCAGC CCAAAGGAGT GGTCGTGCCC
CACGGCGCGC TCGTCCAGAC CTACCACACC TGGGAGTCAG CCTACGGCTT GGATGGCGCC
GTGCGCTGTC ATTTGCAGAT GGCAGCGTTC TCCTTCGATG TCTGCGCCGG CGATCTTATC
CGCGCGCTCG GCTCCGGGGG CACTCTCGTG ATCTGCCCGC GCGACACGCT GCTCGCCCCC
GCCGACCTGC ACGCGCTGAT CGTCGCCGAG GGCGTGGACT GCGCCGAATT CGTGCCGGCG
GTGTTGCGCG AGCTCGTCGC CTACCTGGAA GGCTCCGGCG GCGACCTGGG CTCTATGCGC
CTGCTGATCG CCGGTTCGGA TACCTGGTAC GGCGAAGAGT ATGCCCGGGT CGCCCGCCTC
TGCGGCCCTG ACACCCGCCT GGTCAATTCC TACGGCGTCA CCGAGGCCGT GATCGACAGC
ACCTACTTCG AGGCCGGCGC GGCTGCCGAA CTGCCCGCCA GACGCCAGGT GCCGATCGGC
CGGCCATTCG CGGCGACCCG GGCGTATGTG CTCAATCGGC TCGGTCAGCC GCAGCCGATC
GGCGTGCCCG GCGAACTCTA TCTCGGCGGC TCGCGGCTGG CGCTGGGCTA CTGGCGGCGC
CCCGGATTAA CCGCCGAGCG TTTTGTTCCC GATCCCTTCG CCGGGGAACC AGGCGCACGG
ATGTACCGCA CCGGCGATGC CGCCCGCTTC CGCGCCGACG GCACGATCGA GTTTCTCGGT
CGGATCGACC AGCAGGTCAA GTTGCACGGT GTCCGCATCG AACTGGGCGA GATCGAGGCC
ATCCTGCTCC AGCAGCCCGG CGTCATCCAG GCCGCCGCCG CCATCCGCGA AAATCAACTC
GGCCACCCCA TCCTGGTAGC CTACCTTGTG ACCGACGCGC TAGGCGATGA AGCGGCCTTG
CGGGCGGCTC TGCGCGAACG GCTGCCCGAG CATATGGTCC CTGCGGCGAC TATCATCCTC
CCGTCCCTGC CGCTAACTCC CAACGGCAAG ATCGACCGCC AAGCCCTGCC AGAGCCTGAC
CTTGGCAGAT CGGAGATGAG CGCCGCCTAC GAGGCGCCTG GGAGCCTGAT CGAGGAAACG
CTGGCCGAGA TCTGGGGCGA AGTCCTCAAG CGCGAACAGG TCGGCATTCG CGATAACTTC
TTCGAGCTTG GCGGCCACTC GTTGCTGATC ACCCAGGTGA TCTACCGGGC CAATGAGGCG
TTTGAGCTTA ATCTACCCCT ACGCAGCCTT TTTGAGGAGC CGACGATCGC GGATTTCGCC
CTGAGGGTCG AGGAAGCGCT CCTCGACAAA CTTGAGCAAC TCGACGATGA GGAAGCCCAG
CAGCTGATCG AGGGGTTGTA G
 
Protein sequence
MYTKEELLKR RAQLSANQRT MLDKQLRGEA MLQADPGYAA IGKRPADAPT PLSFSQERLW 
FLQQLDPANA AFNGLRPMMI AGPLDIDLIN RVAREMYRRH EILRTSFRLI DGAPCQIVDQ
DLPRTYAVPV IDLGHLPPGD AEAEARRLMA EEARTPFDMG QAPMMRLTLI SLDAAEHALL
LSLHHIAYDE WSNQVLVDEF SVLYRAFAQG QPSPLPELPI QYGDYAYWQR EWLQGSVLEA
QLAYWAEQLG GALPTLDLPT DHPRPAVQNF RLRTEQTLLP AELAAALRAM SQKEGVTLFM
ALLSAFKTLL FYYTNQPESI VGTFIAGRSR PELERLIGFF VNSLPLRSDL TGDPTFSEVL
KQVRAVTLGA YNHQDVPIEK LIETFTPKRD LSRTAIYQAM FVLQNVPKPG DAAEPSALVI
REWQDADASA GADLQCDITL MVYELPDGGL RCQFEYDSSL FEAATIQRML AQFETLLAAV
SSNPGQRLSR LSLLTDQERQ QVLYDWNATA VPFAVDSCIH TLFESQAARA PQAIALVHGK
ERLTYGELNR RANQLSHYLR TSGVGSGGFV GLALERSVEM VVAVLGVLKA GAAYVPLDPT
YPAARLQFML ADADVGFVLT TGRLRDRLAG TDRTLLEWEA LGNLDAYPPD DPPARATAAS
PAYVIYTSGS TGQPKGVVVP HGALVQTYHT WESAYGLDGA VRCHLQMAAF SFDVCAGDLI
RALGSGGTLV ICPRDTLLAP ADLHALIVAE GVDCAEFVPA VLRELVAYLE GSGGDLGSMR
LLIAGSDTWY GEEYARVARL CGPDTRLVNS YGVTEAVIDS TYFEAGAAAE LPARRQVPIG
RPFAATRAYV LNRLGQPQPI GVPGELYLGG SRLALGYWRR PGLTAERFVP DPFAGEPGAR
MYRTGDAARF RADGTIEFLG RIDQQVKLHG VRIELGEIEA ILLQQPGVIQ AAAAIRENQL
GHPILVAYLV TDALGDEAAL RAALRERLPE HMVPAATIIL PSLPLTPNGK IDRQALPEPD
LGRSEMSAAY EAPGSLIEET LAEIWGEVLK REQVGIRDNF FELGGHSLLI TQVIYRANEA
FELNLPLRSL FEEPTIADFA LRVEEALLDK LEQLDDEEAQ QLIEGL