Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4604 |
Symbol | |
ID | 5902066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4979849 |
End bp | 4981327 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641565123 |
Product | AMP-dependent synthetase and ligase |
Protein accession | YP_001686222 |
Protein GI | 167648559 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.37521 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCACG ATGGTGTCGC GCTAAGTTCC ATTCTAACCC ACCATGCGCG TCGATCCCCC TCTCGCACGG CGCTGATCGT CGATGGCGTT CGCGTCGCCT ATGACGAACT GGACGCGCGC ACAAACCGTC GCGCCAGGAT GCTGGCGGCG CATGGCGTAG GCCATGGCGA CTTCGTCACG GTCGCGCTTC CCAATGGCCT GGAATTCTAC GAGACCACCT TCGCGCTCTG GAAACTCGGG GCGATCCCCA ACATCGTCGC CGCCAAGCTC CCGCGCCTCG AAATGGAGGC GATCCTCGAC ATCGTTCGCC CCAGGCTGTT TGTCGGCGTC CCGCCCGGGG GCGACGTTCC GGCCCTGGCC GAAGGTCAGG CGGAGTTGCA CCGATATTCG ACCGATCCGC TGCCGGAAGT TATCTCGCCG CACTGGAAAG CGATGACGAG CGGCGGCTCT ACCGGCCGGC CGAAGGTGAT CGTGGACGCC ATGCCGGCGC GGTGGAATCC GCAGGAGGGC TTCCTGGGCC AGCGTCCTGG CGACGTGATC CTCAATCCCG GGCCGCTCTA TCACAACGCA CCGTTCCACT GCGTCCACAT GGGTCTGTTC GTCGGCGCCA CGATCGTCGA GATGGGCAAG TTCGATGCGC TCGCCGCGCT CGAACTGATC GACGCCCATC AGGTCAATTG GGTGACCATG GTGCCGACGA TGATGCACCG CGTGTGGCGC CTGGACCCAG AGGTTCGGTC CCGCTTCACG CTGCCCAGTC TGCGCATGAT GCTCCACATG GCCGCCCCCT GCCCCGCCTG GCTCAAGGAG GCCTGGATCG GCTGGCTGGG CGGCGAGCGG GTGTGGGAAT ATTACGGCAC GACCGAAGGG ACGGGATCGA CGATGATTTC CGGCACGGAC TGGTTGGCTC ATCCAGGTTC GGTGGGGCGC GTCCGTGAGG GCTATGCGCT GAAGATTCTC GACGAGACGG GGCGGGAGCG ACCGATCGGC GAGGTCGGCG AGGTCTATTT CCGCCCAGAG GGCGGCGCGG GATCGACCTA CCACTATCTG GGAAGCACGC CCCGGCGGGT CGGCGAATGG GAGACGCCCG GGGACCTGGG GCATGTGGAC GAGGACGGCT ATCTCTATCT TTCCGACCGT CGCAACGACC TGATCATCTC CGGCGGCGCG AACATCTACC CGGCCGAGGT CGAGGCCGCG ATCGACGCGC ATCCGGCCGT TCGGACCAGC GCGGTGATCG GGCTTCCGGA CGAGGAGTGG GGCGCGCGCG TCCATGCGAT CGTCCAGCCG ATCGAGGACT CAGGCCTGGA GGAGGCGGAG CTTCTCGCGT TCGTCGCCGA CCGGCTGGCG CGCTTCAAGC TGCCCAAGAG CGTCGAGTTC ACGCGTGACC CCTTGCGGGA CGAGGCTGGA AAGGTCCGTC GGACCGCGCT GCGCGACGCT CGATTGGGCG GAGGGGCGGG GCAGGTTGTC CCAGCCTAG
|
Protein sequence | MSHDGVALSS ILTHHARRSP SRTALIVDGV RVAYDELDAR TNRRARMLAA HGVGHGDFVT VALPNGLEFY ETTFALWKLG AIPNIVAAKL PRLEMEAILD IVRPRLFVGV PPGGDVPALA EGQAELHRYS TDPLPEVISP HWKAMTSGGS TGRPKVIVDA MPARWNPQEG FLGQRPGDVI LNPGPLYHNA PFHCVHMGLF VGATIVEMGK FDALAALELI DAHQVNWVTM VPTMMHRVWR LDPEVRSRFT LPSLRMMLHM AAPCPAWLKE AWIGWLGGER VWEYYGTTEG TGSTMISGTD WLAHPGSVGR VREGYALKIL DETGRERPIG EVGEVYFRPE GGAGSTYHYL GSTPRRVGEW ETPGDLGHVD EDGYLYLSDR RNDLIISGGA NIYPAEVEAA IDAHPAVRTS AVIGLPDEEW GARVHAIVQP IEDSGLEEAE LLAFVADRLA RFKLPKSVEF TRDPLRDEAG KVRRTALRDA RLGGGAGQVV PA
|
| |