Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2787 |
Symbol | |
ID | 5900242 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3025391 |
End bp | 3026671 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641563279 |
Product | citrate synthase I |
Protein accession | YP_001684412 |
Protein GI | 167646749 |
COG category | [C] Energy production and conversion |
COG ID | [COG0372] Citrate synthase |
TIGRFAM ID | [TIGR01798] citrate synthase I (hexameric type) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.420339 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.059507 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACA AAGCCACGCT GACGATCGGC GACAAGAGCT ACGACCTGCC GATCCTCAAG GGCAGCACGG GACCCGACGT CGTCGACGTG CGGAAGTTCT ATGGCGACTC CGACCATTTC ACTTTCGATC CGGCCTTCAC CTCGACCGCA TCCTGCGAAA GCAAGATCAC CTATATCGAT GGCGACGCCG GCGTTTTGCT GCACCGCGGC TATCCGATCG GCCAACTGGC CGAGCAGTCC AGCTTCCTGG AAGTCTGCCA CCTGCTGCTG AACGGCGAAC TGCCGACGGC CGACGAATTC ACCAAGTTCG AACGCAACAT CACCTACCAC ACGATGCTGC ACGCCCAGTT CGACAGCTTC TTCCAGGGCT TCCGCCGCGA CGCCCACCCG ATGGCGGTGA TGACCGGCGC GGTCGGCGCC CTGTCGGCCT TCTATTCCGA CAGCCTGAAC GTCGATGATC CCAAGCAGCG CGAGATCAGC GCTCACCGCC TGATCGCCAA GATGCCGACC ATCGCCGCGC GCGCCTTCCA GTACTCGCAA GGCCGTCCGT TCGTGACGCC GCGCAACGAG CTGAGCTACT CGGAAAACTT CCTGCGCATG TGCTTCTCGG TGCCGGCCGA GGATTGGGTC CCCAACCCCA TCCTGACCCG CGCCATGGAT CGCATCTTCA TCCTGCACGC CGACCACGAG CAGAACGCCT CGACCTCGAC CGTCCGTCTG GCCGGCTCGT CGGGCGCCCA CCCGTTCGCC TGTATCGCCG CCGGCATCGC CTGCCTCTGG GGTCCGTCGC ACGGCGGCGC CAACCAGGAA GCCCTGGAGA TGCTGGAAGA GATCGGCACG GTCGAGAACA TCCCCGCCTA TGTGCAGGGC GTGAAAGACC GCAAGTACAA GCTGATGGGC TTTGGCCACC GGGTGTACAA GAACTTCGAC CCCCGGGCGA CGGTCATGCA GAAGACCTGC TACGAGGTTC TGGAGCAGTT GGGCATCGAC GACCCGCTGC TGCAAGTGGC CATGGAGCTG GAAAAGGTCG CGCTGAGCGA TCCCTACTTC ATCGATCGCA AGCTCTATCC GAACATCGAC TTCTATTCGG GCATCACCCT GCGCGCGATG GGTTTCCCGA AAGAGATGTT CACGGTGCTG TTCGCCCTGG CTCGCACCGT CGGCTGGATC AGCCAGTGGA AGGAAATGTT CGAGGACCCC AACCGCAAGA TCGGCCGCCC GCGCCAGCTC TATACGGGCG CCACGCAACG CGACTACGTG TCGGTCGACA AGCGCGGCTA A
|
Protein sequence | MTDKATLTIG DKSYDLPILK GSTGPDVVDV RKFYGDSDHF TFDPAFTSTA SCESKITYID GDAGVLLHRG YPIGQLAEQS SFLEVCHLLL NGELPTADEF TKFERNITYH TMLHAQFDSF FQGFRRDAHP MAVMTGAVGA LSAFYSDSLN VDDPKQREIS AHRLIAKMPT IAARAFQYSQ GRPFVTPRNE LSYSENFLRM CFSVPAEDWV PNPILTRAMD RIFILHADHE QNASTSTVRL AGSSGAHPFA CIAAGIACLW GPSHGGANQE ALEMLEEIGT VENIPAYVQG VKDRKYKLMG FGHRVYKNFD PRATVMQKTC YEVLEQLGID DPLLQVAMEL EKVALSDPYF IDRKLYPNID FYSGITLRAM GFPKEMFTVL FALARTVGWI SQWKEMFEDP NRKIGRPRQL YTGATQRDYV SVDKRG
|
| |