Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4949 |
Symbol | |
ID | 5902411 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5347471 |
End bp | 5348619 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641565469 |
Product | citrate synthase |
Protein accession | YP_001686567 |
Protein GI | 167648904 |
COG category | [C] Energy production and conversion |
COG ID | [COG0372] Citrate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.537298 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.392134 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGACT GGATCGACGC GGAACGGGCG ATGGCGACGC TGGGGGTGCG GGCCCAGACG CTGTACGCCT ATGTCAGCCG GGGCCGGGTG GCCGCCGCCG CCCATCCGGA CGATCCACGC CGCAGCCTCT ACCGCGCCTC CGATATCGCG GCCCTGGCGG CCAAGAAGGC CCGGGGGCGG CGCGCCGCCG ACGTGGCGGC CGAGGCCATC GCCTGGGGCG AGCCGGTCTT GCCGTCGGCG ATCACCACGG TGGTGGACGG ACGGCTCTAT TATCGCGGCC AGGACGCCGT CGATCTGGCC CGCATGCACA GCCTGGAGCA GGTGGCGCGG CTGCTGCGCG GCGGCCATGG CGCGCCGCTG ACCGGTCCGG AGCCGAAGAA ACCCAAGAAG GGCGAGACGC CGCGCGCCCG CCTGTTCTCG ACCCTGGCCG TGCGGGCCGG GACCGACCCG CCGGCCCGGG GCCGGGCGCC CCTGGCAATG GCCATGGAGG CCGCCGGCCT GCTGGAAGCC GCCGCCGACG CCGTGGCCGG GTCAATCGGC CAGGGTCCGA TCCATCGCCG GCTGGCCGAC GCCTGGAGCC TGGACGCTTC GGGCGCCGAC CTGGTCCGCC GCGCTTTGGT GCTGCTGGCC GACCACGAGC TGAACGCCTC CACCTTCGCG GTGCGGGTGG CGGCCTCGAC CGGCGCCTCG CTGGCGGCTG CGTCGCTGGC CGGTCTGGCG GCGCTGTCAG GCCCCCTGCA CGGCGGCATG GCGGCCCGGG TGGAGATGTT CGTCGACGAG GCCGAGCGGC GCGGCCCGAC CCGCGCGGTG GCCGAGCGCC TGGCCCAGGG CTCGGCCATG CCAGGCTTTG GCCACCCGCT CTATCCCGAC GGCGACCCCC GCGCGGCGGC CCTGCTGGAG GCGTTCAAGG TTCCGGAGGG GCTGGCGCAA TTGCGCGCCG AGACAGAGGC CGCCACGGGC CTGCGCCCCA ACATCGACTT CGCCCTGGTG GCCATGGCCC GCGCCCGCGC CCTCCCCGCC GACGCGCCCT TCAGCCTGTT CGCGGTGGGC CGGATGGCGG GCTGGCACGC CCACGCGATC GAGCAATTGC AGACGGGGCA GCTGATCCGG CCGAGGGCGC GGTATGTGGG CGTGCGGCCG GGCGCTTAG
|
Protein sequence | MADWIDAERA MATLGVRAQT LYAYVSRGRV AAAAHPDDPR RSLYRASDIA ALAAKKARGR RAADVAAEAI AWGEPVLPSA ITTVVDGRLY YRGQDAVDLA RMHSLEQVAR LLRGGHGAPL TGPEPKKPKK GETPRARLFS TLAVRAGTDP PARGRAPLAM AMEAAGLLEA AADAVAGSIG QGPIHRRLAD AWSLDASGAD LVRRALVLLA DHELNASTFA VRVAASTGAS LAAASLAGLA ALSGPLHGGM AARVEMFVDE AERRGPTRAV AERLAQGSAM PGFGHPLYPD GDPRAAALLE AFKVPEGLAQ LRAETEAATG LRPNIDFALV AMARARALPA DAPFSLFAVG RMAGWHAHAI EQLQTGQLIR PRARYVGVRP GA
|
| |