Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3729 |
Symbol | |
ID | 5735593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4688319 |
End bp | 4690103 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280881 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001546493 |
Protein GI | 159900246 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1022] Long-chain acyl-CoA synthetases (AMP-forming) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATTTC GTAATCTGGG TGAGTTGTTT CGCGAGCGGT CACAAGCCTT TGCCCATCTC AATCGCTGGC GAACTCGCCG CAATGGCGAA TGGATTACCT GCACCAACGC CGAACATCAA CGCCATGTCT ATCAGTTGAT GGCAGGTTTT CAACAGTTGG GGTTGCAAAA AGGCGATCGC GTGGGCATTA TGTCCAATAC CAGCGTTGAT TGGGTCGAAT CGGATTGGGC TTTGGTCTGT TCAGGCGCTG TGCCAGTTTC AATTTATCCC TCGTTGATGG CCGATACGGT GGCCTTTATT GCCCAAGATG CCGACCTCAA ATTTTTGTTG ATCGAAAATC GCGAGCAATA CGATAAATTG CAAAAGGTCC GCTCGCAGCT TGAGCATATC GAACGGGTAA TTATTTTTGA TGGCCGGGAT TTGCCCAGTG ATGACCCATG GATTTTATCG CTAACCAGTT TGCGGCGCAT GGCCACCAGC GATGCAACCG CCCAAGAGGT TTTTGCCACC AACTGCGCCC AACAGATCGA GCCAGAAGAT TTGGCGACAA TTGTCTATAC CTCGGGCACA ACTGGCAATC CCAAAGGCGC AATGTTGGCG CATCGCGCCT TGCTTGGCGA ACTAACCGCG ATTCGCACGA CTATGGCCAT GCAAGCTGGT GATGATGATG TGTTATTTTT GCCAGCCGCG CATATTTTTG GCCGCTTGCA ACATATGTGT GGGGTCGATA ATGGCCTTAA CACCGCAATC ATCGAATCGA TCAAACAAGT GCTTGAAGAT GTGCAAGCAA TCAAACCAAC CTTTTTCTTC AGCGTCCCGC GCATGTACGA AAAGATTTTC AGCACAGCCC AGGCTCGCGC CGAAGCCAGC CCAATTCGCA AACGGATTTT TGCCTGGGCG CTGGCGATCG CCCGTCAAAT GAGCCGCTAC AAAGGCCAAA AAGCAGCGGT TCCGGCAGCA TTCAAGCTGA AATATGCCTT GGCTGATCGC TTAGTGTTTA AGAAGGTTCG TGCCTTACTT GGCGGCAATA TTCGCTATGC AATTACTGGC GGCGCTCCGC TCGATATCGA AATTTTGGAA TTTTTCAATG GTGCGGGTGT GCTGTTACTC GAAGGTTGGG GCTTGACCGA AACATCTGCT GCGGTAACCG CCAATCGCCC TGATGATTAT CGGCTAGGCA CGGTTGGCAA GGTGTTTCCT GGCAACGAAA TCAAAATCGC TGACGATGGC GAAGTGCTGG TGCGCGGTAA TCTAATTCTA AGTGGCTACT ATAATAATCC GCAAAAAACC AACGAAGCCT TGATCGATGG CTGGTTTCAC ACCGGCGACA TCGGCAAAAT TGATGCTGAT GGCTTCTTGA GCATTGTTGA TCGCAAAAAA GATTTGCTAA TTACCGCGTC GGGCAAAAAT ATTGCGCCGC AAGCCGTCGA AGCTGCCTTC AAAAATAGCC CGTACATCTC GCAGTGTGCC GTCTTTGGTG ATCGCCGACC CTATTTGGTG GCATTGTTCA CGCTCGATAT GGAAGCCGTC ACCGCTTGGG CTAATCGTGA GCATGTGCCA GTTGATGCTA ATCTGCATAA GCATCCTAAA TTAGTTGCTG CGATTGAACA CGAAGTGCAA ACAATCAACC CAACCTTGCC TTCATTCGAG CAAATTAAGG CCTATGAAAT TTTGCCCGAA GATTTTACCA TTGAAAACGA TTTGCTCACG CCAACGCTCA AAATTCGTCG CCGCCAAATC TACGAACGCT TCGCCAAGAG TTTTGAGCAA TTGTACAAAC GTTAG
|
Protein sequence | MSFRNLGELF RERSQAFAHL NRWRTRRNGE WITCTNAEHQ RHVYQLMAGF QQLGLQKGDR VGIMSNTSVD WVESDWALVC SGAVPVSIYP SLMADTVAFI AQDADLKFLL IENREQYDKL QKVRSQLEHI ERVIIFDGRD LPSDDPWILS LTSLRRMATS DATAQEVFAT NCAQQIEPED LATIVYTSGT TGNPKGAMLA HRALLGELTA IRTTMAMQAG DDDVLFLPAA HIFGRLQHMC GVDNGLNTAI IESIKQVLED VQAIKPTFFF SVPRMYEKIF STAQARAEAS PIRKRIFAWA LAIARQMSRY KGQKAAVPAA FKLKYALADR LVFKKVRALL GGNIRYAITG GAPLDIEILE FFNGAGVLLL EGWGLTETSA AVTANRPDDY RLGTVGKVFP GNEIKIADDG EVLVRGNLIL SGYYNNPQKT NEALIDGWFH TGDIGKIDAD GFLSIVDRKK DLLITASGKN IAPQAVEAAF KNSPYISQCA VFGDRRPYLV ALFTLDMEAV TAWANREHVP VDANLHKHPK LVAAIEHEVQ TINPTLPSFE QIKAYEILPE DFTIENDLLT PTLKIRRRQI YERFAKSFEQ LYKR
|
| |