Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3540 |
Symbol | |
ID | 5900995 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3816667 |
End bp | 3818457 |
Gene Length | 1791 bp |
Protein Length | 596 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641564047 |
Product | long-chain-acyl-CoA synthetase |
Protein accession | YP_001685165 |
Protein GI | 167647502 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.498772 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTTGC CGGCCAGGTT GAAGCGTGAA CTGCGGTTCC TGAAGGGGCT TTCGCGCACC CTGAAACGCG TGAAGTCCAT CGCGCCAGAC AGCACCAACC TGATCTGCGA CGACCTGGAA GCCGCCGTCG ATAAGTGGCG CGACAGCCAG GCCATCGTCT TCGAAGGGCG CTCGGTCACC TATGCCGAGT TGGACGCCAT CGCCAACCGC TACGCCCACT GGGCCAAGGG GCAGGGGATC ACCCGCGGCC AGACCGTGGC GCTGTTCATG CCCAACAGGC TGGAATACGT GGCGATCTGG TACGGACTGT CGAAGGTCGG GGTGGCCACG GCCCTGATCA ACAACCAGCT GACCGGCCCG GCCCTGGCTC ACTGCCTGAA CATCTCCCAG GCCCTGCACT GCATCGTCGA TCCGGAGACC TCGTCCTGTT TCGAGCAGGT GAAGGGATCG CTGGAGCGCC ACGTCCAGCA GTGGGTGTTG GGTCCCGCCT ACGGCGACCA GCGCGATCTG GTCAACGCGC TCAAGAGCTG CAGCCAACTG CGGCCGGATC GCCTGACGGC CCGTAACGGC CTGACCGCGC GCGACACCGC CCTCTATATT TTCACCAGCG GCACCACGGG CCTGCCCAAG GCGGCGCGCA TCACTCACAT GCGGGCCCAG CTCTACATGC GTGGTTTCGC CGGCTCGACC GATGCACGCC ACACCGACCG CATCTACATC ACCCTGCCGC TCTATCACGC CACCGGCGGC CTGTGCGCGG TGGGCGCGGC CCTGCTGAAC GGCGGAACAG TGGTGCTGCG CAAGAAGTTC TCGGTCAGCG CCTTCTGGGA CGACGTGGTC GCCGAGAACT GCACGATGTT CGTCTATATC GGCGAGCTGT GCCGCTACCT GGCCAACCAC CCGGAGGGTC CCAACGAGCG GGCGCACAAG ATCCGGCTGA TCTTCGGCAA CGGCCTGCGT CCCGACGTCT GGGACGTGAT GCTCGACCGC TTCAAGGTCG GCGGGGTGCT GGAGTTCTAC GGCGCGACCG AGGGCAATGT CTCGCTGTTC AACTTCGATG GCCGGCGCGG GGCCATCGGC CGGGTGCCAG CCTATCTGAA GAAGAAGTTC AACATCCGCA TCGTCAAGTT CGACGTCGAG ACCGAAACCC CGGTCCGGGC CGCCAACGGC TGCTGCATCG AGGCCGCGCC CGGCGAGATC GGCGAGTGCA TCGGCCACAT CGCCAGCGAC GCCCGCTCCA ACTTCACCGG CTACGCCGAC AAGGCCGCCA CCGAGAAGAA GATCCTGCAC GACGTCTTCG AGAAGGGCGA CGCCTGGTTC CGCACCGGCG ACCTGATGCG CGCCGACAGC GACGGCTATC TCTATTTCAT CGACCGGATC GGCGACACCT TCCGCTGGAA GGGCGAGAAC GTCGCCACCA GCGAGGTGTC CGAGCGCCTG TCCGCGGTGC CCGGCGTCAA GGAGGTCAAT GTGTACGGCG TGCCAATCGG CGATCTGGAC GGCAAGGCCG GCATGGCGGC GCTGGTGGTC GACGGCACGT TCGAGATCGC GGCCCTGGCC GAATATGTCG ATCGCGAGCT GCCTGTCTAC GCCCGTCCGA TCTTCGTGCG CCTGCAGCCC GAGATCGAGA CGACAGGCAC CTTCAAGTAT CGCAAGATCG ACCTGGTGAA GGAGGGTTTC GATCCCGCCA ACACCCGGGA CCCATTGTAT TTCCGCGACC CGGCCAAGGG CTATGTGAAG CTGACCAAGG CGGTCCACGC CAAGATCCTG GCGGGCGGCT ACAGGCTTTA G
|
Protein sequence | MSLPARLKRE LRFLKGLSRT LKRVKSIAPD STNLICDDLE AAVDKWRDSQ AIVFEGRSVT YAELDAIANR YAHWAKGQGI TRGQTVALFM PNRLEYVAIW YGLSKVGVAT ALINNQLTGP ALAHCLNISQ ALHCIVDPET SSCFEQVKGS LERHVQQWVL GPAYGDQRDL VNALKSCSQL RPDRLTARNG LTARDTALYI FTSGTTGLPK AARITHMRAQ LYMRGFAGST DARHTDRIYI TLPLYHATGG LCAVGAALLN GGTVVLRKKF SVSAFWDDVV AENCTMFVYI GELCRYLANH PEGPNERAHK IRLIFGNGLR PDVWDVMLDR FKVGGVLEFY GATEGNVSLF NFDGRRGAIG RVPAYLKKKF NIRIVKFDVE TETPVRAANG CCIEAAPGEI GECIGHIASD ARSNFTGYAD KAATEKKILH DVFEKGDAWF RTGDLMRADS DGYLYFIDRI GDTFRWKGEN VATSEVSERL SAVPGVKEVN VYGVPIGDLD GKAGMAALVV DGTFEIAALA EYVDRELPVY ARPIFVRLQP EIETTGTFKY RKIDLVKEGF DPANTRDPLY FRDPAKGYVK LTKAVHAKIL AGGYRL
|
| |