Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3127 |
Symbol | |
ID | 5734999 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3947286 |
End bp | 3949175 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280270 |
Product | amino acid adenylation domain-containing protein |
Protein accession | YP_001545892 |
Protein GI | 159899645 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1020] Non-ribosomal peptide synthetase modules and related proteins |
TIGRFAM ID | [TIGR01733] amino acid adenylation domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAATAG CGACATTAGC GCCGATCACC GCCGACCGCA CAGGCACAGT TTTTGTGCAT GATGTTATCA GCACGCACGC CCAGCATTCG CCACAGGCTA TTGCCATTGC TACCAGCACA TTCAAGTTGA GTTACGCTGA ATTCGAACAG CGCACCAATC AGCTTGCCCA TTATTTACAC CGCCAAGGAG TGCATCGTGG CCACACAGTT GGCGCGTGCT TTGAGCGCTC AGTCGAGGCA ATGATCGCCG CCGTGGCGAT TTGGAAGGCT GGCGCAGTCT ACTTGCCGCT TGATCCAGGC TATCCCCAAG AACGGCTTAA ATATATGTTG GGCAATAGTG GCGCGAGTTT GGTACTGGCA ACCCAACTCA CCGCCAGCCA GTTTCCTGAG CAACAATTGC ATATTTTTGA GCAATTAGCC GCTGAGCTTG CACAGCAACC AAGCCATGCC CCTGAACATC AACTTACACC CGACGATCTG GCCTATATCA TTTATACCTC TGGCTCAACT GGCAAGCCCA AAGGCGTGCT CGTGCCGCAT CGCGGTTTGG CCAATTTGGC GGCTGCTCAA ACCGAGCGCT TTGGCATCAA CAGCCAATCA CGAATTTTGC AGTTTGCCTC GCCAAGCTTC GATGCTTCGA TTTCTGAGAT GCTGACGGCT TTTTTTCAGG CTACAACTTT GTTTGTAGCC CCAACGAACG ATCTTTTGCC CGGCCCAGAC TTATTGACAA CCCTGCGTGA TCACCACATC ACGGTTGCCA CCCTGCCGCC TTCGGTGCTG GCGTTGCTCG ATCCACGCGA CTTGCCCAAT TTACAAACCA TCGTTTCGGC AGGCGAGGCT TGCACCGCTG AAATTGTGGC CCGTTGGGGG ACAAACCGCC GTTTTATCAA TGCCTATGGC CCGACCGAAG TCACCGTTTG CGCCACCATG AGCCAGTCTT TACGCTACGG TATGGCCGTC AGCATTGGCA ACGCTATCAG CAATAGCCAA ACCTATATCG TTGATGAGCA CTTAAATTTG GTCGAAGGCG AGGCGGTTGG CGAATTATTG GTCAGCAGTG TTGGCTTAGC CCATGGCTAC CTTGGCCTTG GCGATCAAAC TGCCGAGCGC TTTTTGCCCA ATCCATGGAG CGACCAAGCG GGCAGTCGGA TGTATCGCAC TGGCGATTTG GTGCGGCGCT TGAGCGATGG CAGCCTTGAG TTTCGCGGAC GCATTGATCA TCAAATTAAG CATCGCGGCT ATCGCATCGA TCCAGGCGAA ATTGAAATGC TCTTGATGGA ATATCCTAAT GTGCGCCACG CCGTCGTCAC GCTGCATCAC GACCATAACC AGACCGAGCG GTTGGTCTCG TATTTGGTGT TACACGGCGA AGTTATGCCC TACTATCGCG ATATTTACCG CTATTTGGAG AGCATGCTGC CCAAATATAT GGTACCGCTC TCGTATACGG TCGTGAAAGA ATTGCCACGC ACGCCCAATG GCAAACTCGA TTTAGCAGCC TTGCCTGAGC CAGATTTTGC CTTGCTGACC GTTAGTGAAA ATTATGTTGC CCCACGCACA CCGCTTGAAC AGCAGATCGC CGCGATTTGG GAAAATATTT TGGATACTCC CAATATTGGT GTGCTTGATG ATTTCTTTGA TGCTGGCGGC CACTCGCTGT TAGCAACCCA AATTGTTTCA GCGATTCGCA GCACGTTTGC GGTGGAAATT CCACTTTCAG TCTTGCTTGG CGTTGAGCCA ACCATCGCCG CCACGGCCCA ATTGATCGAG CAATATCAAA TTGCTAATGC CGATGATGCC GAACTCGCCG ACCTGTTGAA CGAACTTGAT GGCCTCTCCG ATGAAGAAAT TCAAGCCTTG TTAGCTGATG AAGGAGCGCT CACCGCATGA
|
Protein sequence | MSIATLAPIT ADRTGTVFVH DVISTHAQHS PQAIAIATST FKLSYAEFEQ RTNQLAHYLH RQGVHRGHTV GACFERSVEA MIAAVAIWKA GAVYLPLDPG YPQERLKYML GNSGASLVLA TQLTASQFPE QQLHIFEQLA AELAQQPSHA PEHQLTPDDL AYIIYTSGST GKPKGVLVPH RGLANLAAAQ TERFGINSQS RILQFASPSF DASISEMLTA FFQATTLFVA PTNDLLPGPD LLTTLRDHHI TVATLPPSVL ALLDPRDLPN LQTIVSAGEA CTAEIVARWG TNRRFINAYG PTEVTVCATM SQSLRYGMAV SIGNAISNSQ TYIVDEHLNL VEGEAVGELL VSSVGLAHGY LGLGDQTAER FLPNPWSDQA GSRMYRTGDL VRRLSDGSLE FRGRIDHQIK HRGYRIDPGE IEMLLMEYPN VRHAVVTLHH DHNQTERLVS YLVLHGEVMP YYRDIYRYLE SMLPKYMVPL SYTVVKELPR TPNGKLDLAA LPEPDFALLT VSENYVAPRT PLEQQIAAIW ENILDTPNIG VLDDFFDAGG HSLLATQIVS AIRSTFAVEI PLSVLLGVEP TIAATAQLIE QYQIANADDA ELADLLNELD GLSDEEIQAL LADEGALTA
|
| |