Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2071 |
Symbol | |
ID | 5899526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2210873 |
End bp | 2213884 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641562560 |
Product | SMP-30/gluconolaconase/LRE domain-containing protein |
Protein accession | YP_001683697 |
Protein GI | 167646034 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGCGA TCCTATTGTC GGCGATCCTG GCGGCCTCGA CCCCCGCCGT GGCGGCGACC GCCTCCAACT CCGTCCTGCT GACGGCGCCG GACGATCCGC GCGCGATCAT GGTGAAAGGC GCGGGCGATG GCCGCGCGGA CGACACCGAG GCCGTTCAAC ACGCCATCGA CGCCGCCCGC GACAAGACTG GCCATGGCGT CGTCTTCCTG CCGTCGGGGC GCTATCGCCT GAGCCGCAGC ATCCTGGTGC CGCCGGGCGT GCGCATCTTC GGTGTTGGGG CCAAGCGGCC TATCTTCGTG CTTGGTCCGA ACACGCCGGG ATTCCAGAAG GGCGTGGGCA CGATGATCGT GTTCACGGGC GGCGATCAGT ACGCCGTCGG CAAGATCCCC GTGCCGGTTC CGACCGCGGT GGGCGCCAAG GGCGATGTGC GCGACGCCAA CTCCGGGACC TTCTATTCGG CGCTGAGCAA TGTCGATATC GAGATCGGCG CAGGCAACCC GGCCGCGGCG GGGGTTCGGT TCCGCATGGC CCAGCACGCC TTCCTCAGCC ACATGAACTT CAACCTGGGT ACGGCCTTCG CCGGCGTCTA CCAGGCCGGC AACGTCATGG AGGACGTCCA CTTCCACGGC GGTCGCTATG GCATCGTCAC CGAAAAGACC TCGCCGGCCT GGCAGTTCAC TCTGATGGAC TCCACCTTCG ACGGCCAGCG CGACGCGGCG ATCCGCGAGC ACGAGGTGGA TCTGACGCTG GTCAATGTGG CGATGCGCAA CACGCCCGTC GGCATCGAGA TCGACCGCGG CTACAGCGAC TCCCTGTGGG GCAAGAACGT CCGCTTCGAG AACGTCTCCA AGGCCGCGGT GGTCATTTCT GCTGAAGACA ACGTCTTCAC CCAGGTGGGC TTCGAAAACG CCGTGGCGTC CAACACGCCG GTGTTCGCCC GCTTCCGCGA CAGCGGCAAG ACCGTGGCCG GCAAGGGCCG CGCCTATCGG GTGGCCTCGT TCACCTACGG CTTGACGCTG CCGGGCCTTG GCCAGATGGG CGAGTACAAG ACGCAGGCCG ATATCACCAC CCTGCCGGGC TTGTCGAAAG CCGCCGCGCC GGCCATCCGC CCGCTGCCGC CGGTGAGCGC GTGGACCAAT GTCAAGACGC TCGGCGTCAA GGGCGACGGC GAGACCGACG ACACCGCCGC GTTGCAGGCG GCGATCGAAA CCCACCGCGT GCTCTACCTG CCGATCGGCT TCTACAAGAT CACCGACACG CTGCGGCTTC GGCCGGACAC CGTGCTGATC GGCCTGCATC CGGCCATTAC CCAGCTTGTC CTGCCCGACA ACAATTCCCG CCATGCCGGG GTCGGCGCCG TCGCGCCGAT GATCGAAACG CCGAAGGGCG GCGACAACAT CCTGGCCGGG ATCGGGCTGT TCACGGGACG CGTCAATCCG CGCGCCTCGG CGCTGCTCTG GCGGTCGGGC GAGACCTCGC AGGTCACGGA CGTCAAGATC ATGGGCGGCG GCGGCACGCC GACCATGGAT GGCAAGCCGC TGGGCGCGGC CCAGGCGCGC AGCGGCGATC CGGTGGCCGA CGGCCGCTGG GACGGGCAAT ATCCCAGCAT CTGGGTGACT GATGGCGGCG GCGGGACCTT CGCCAATGTC TGGAGCCCCA ACACCTTCGC CTCGGCCGGA TTCTACATCT CCGACACCAG CACGCCGGGC CACATCTACG AGGTGTCGGT CGAGCACCAC GTCCGCAACG AGTTCGTGCT CGACAACGTC CAGAACTGGG AATTCCTGGC GCCCCAGACG GAGCAGGAGG TCGGTGACGG GCCGGACGCC GTGTCGCTGG AAATCCGCAA CTCGCGCAAC ATCCTGTTCG CCAACTATCA CGGCTATCGG GTGACGCGGT CGTACCACCC CGCCGAAACG GCTGTGAAGC TGTTCAACTC GTCCGACATC CGCTTCCGCA ATGTCCATAT CAACGCCGAG AGCGGCGTCG CGCTGTGCGG CGCGAGCGGC TGCGGGACCT ATCTGCGGGC CAGCAAGTAT CCCTTCGAGA ACGCGATCCA GGACAAGACC CGCAAGCTGG AGGTGCGCGA ACGTGAGTTC GCGGTGCTCG ACGTCACCAG CGAGCCCGGG CCCGCCGCGC CGGCGCGCGA CGAGAAGGTC CGGAAGCTGG AGACCGGCTT CTGGTCGATC TCGGGCGCGA CGGTGGGCGC GGACGGCGCG CTCTATTTCG TGGAGAAGCG GTTCCAGCGG ATCTATCGGT GGACGCAGGC TCGGGGTCTC GAGGTGGTGC GCGACCAAGC GCTCGACCCG GTGAACCTGG CGGTCGACCG TTCCGGCAAG CTTCTTGTGC TCTCCTCTTA CGGTCCAGAG GGAAGCGTCT ATTCGATCGA TCCGGCTGGA CCCAAGGACC AACTGACGAT GATCGCGGCG ACCCCGTCCG CGTCGCGTCC AGACGCCAAG ACGCTGCTGC CGGTCAACTG GTGGAACAAC GGCGAGTTCA GGGACCAATA CGATCCGTCG ACCGACCATT TCACCACCCT GGCCGAAATG TTCGCCCGCG ACGCCGGCGC GCCCAAGGAC AAGTCCTACG TCTCGCCTGA CGGCAGCCTT GTCCTGCCGG CCTTCCGTGT GTGGGAGCAA GGTCCTCCCG ATCACGTCGG CTGGCGCTGG TCCGACACCC TGCAGACCCA TGGCCTCATC GCGGCTGAAC CCGGCGAGCG GGTGTTCGTC ACCAACGAGT CCGAAGGCAA GACCTATAGC GGCAAGGTCG GTCCCGGCGG CGCCCTGACG GACCTGAAGG TGTTCGCCAA CCGGGGCGGG GAGAGCGTCG CGGTCGATGA CCAGGGACAG GTGTTCGTCG CCAATGGCCA GATCTTCAAA TATGCGGCCG ACGGCCAGCC CACCGGGCGC ATCGACCTGC CCGAGCGGCC GCTGCAGCTG ATCTTCGGCG GCCAGGACCG GCGCACGCTG TTCGTCCTGA CCCACCACTC GCTCTATGCG GTGACGCCAT GA
|
Protein sequence | MRAILLSAIL AASTPAVAAT ASNSVLLTAP DDPRAIMVKG AGDGRADDTE AVQHAIDAAR DKTGHGVVFL PSGRYRLSRS ILVPPGVRIF GVGAKRPIFV LGPNTPGFQK GVGTMIVFTG GDQYAVGKIP VPVPTAVGAK GDVRDANSGT FYSALSNVDI EIGAGNPAAA GVRFRMAQHA FLSHMNFNLG TAFAGVYQAG NVMEDVHFHG GRYGIVTEKT SPAWQFTLMD STFDGQRDAA IREHEVDLTL VNVAMRNTPV GIEIDRGYSD SLWGKNVRFE NVSKAAVVIS AEDNVFTQVG FENAVASNTP VFARFRDSGK TVAGKGRAYR VASFTYGLTL PGLGQMGEYK TQADITTLPG LSKAAAPAIR PLPPVSAWTN VKTLGVKGDG ETDDTAALQA AIETHRVLYL PIGFYKITDT LRLRPDTVLI GLHPAITQLV LPDNNSRHAG VGAVAPMIET PKGGDNILAG IGLFTGRVNP RASALLWRSG ETSQVTDVKI MGGGGTPTMD GKPLGAAQAR SGDPVADGRW DGQYPSIWVT DGGGGTFANV WSPNTFASAG FYISDTSTPG HIYEVSVEHH VRNEFVLDNV QNWEFLAPQT EQEVGDGPDA VSLEIRNSRN ILFANYHGYR VTRSYHPAET AVKLFNSSDI RFRNVHINAE SGVALCGASG CGTYLRASKY PFENAIQDKT RKLEVREREF AVLDVTSEPG PAAPARDEKV RKLETGFWSI SGATVGADGA LYFVEKRFQR IYRWTQARGL EVVRDQALDP VNLAVDRSGK LLVLSSYGPE GSVYSIDPAG PKDQLTMIAA TPSASRPDAK TLLPVNWWNN GEFRDQYDPS TDHFTTLAEM FARDAGAPKD KSYVSPDGSL VLPAFRVWEQ GPPDHVGWRW SDTLQTHGLI AAEPGERVFV TNESEGKTYS GKVGPGGALT DLKVFANRGG ESVAVDDQGQ VFVANGQIFK YAADGQPTGR IDLPERPLQL IFGGQDRRTL FVLTHHSLYA VTP
|
| |