Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2366 |
Symbol | |
ID | 5899821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2566892 |
End bp | 2568145 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641562857 |
Product | sarcosine oxidase beta subunit family protein |
Protein accession | YP_001683991 |
Protein GI | 167646328 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0665] Glycine/D-amino acid oxidases (deaminating) |
TIGRFAM ID | [TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTATT CCTCGGCGAC CTTAATTTCG CAGTGGCTTC GAAAGCATCG CGACTGGGCT CCTGCCTGGG CCGACCGGCC TTTGCGCCCT CATTATGATG TCGTTGTGGT TGGCGGTGGC GGACACGGCC TCGCGACCGC CTACTACTTG GCGAAGAACC ACGGCATCAG CCGCGTAGCG GTGCTCGAAA AAGGCTGGAT CGGCGGCGGC AACGTTGGCC GAAACACGAC GATTGTCCGG TCCAACTATC TCTTGGACGG CAACATCCCG TTCTACGAAT GGTCGCTTCG GCTTTGGGAA GGGCTTGAGC AGGACCTGAA CTACAACGTC ATGGTCAGCC AGCGCGGTGT TCTAAACCTC TATCACTCCG ACTCACAGCG AGATGAGGCC GCCAGGCGCG GCAATGCGAT GCGGCTCCAC GGCGTGGATG CGGACCTGCT CGATCGCGAT CGCGTCCGCC GAATGGCGCC GTTCCTGGAT TTCGAGAACG CGCGCTTTCC CATCATGGGC GGCCTGCTCC AGCCGCGCGG CGGAACGGTG CGCCACGACG CCGTCGCCTG GGGCTACGCC AGGGCCGCGT CGGCGCTCGG GGTCGACATC ATTCAAGACT GCGAGGTGCT CGGGATCGAG CGCGATGCGT CGGGCGCGGC GGTCGGGCTG GAAACGAACC GCGGCCGGAT CGGCGCTGGA CGGATTGGCC TTGCGGTCGC TGGCAACAGC TCCCGCGTCG CCGCGCTGGC CAACCTGAAG CTGCCGATCG AAAGCCACGT GCTGCAGGCC TTCGTCTCCG AAGGTCTCAA GCCGGTCATT CCGGGCGTCA TAACCTTTGG AGCGGGGCAC TTCTATATCA GCCAGTCCGA CAAGGGCGGA CTGGTCTTCG GCGGCGATCT CGACGGGTAC AACAGCTATG CCCAGCGCGG GCAGATGACC ACCGTGGAGG ATGTGTGCGA GAGCGGCATG GCTGTGATGC CGATGATCGG GCGCGCCCGC ATCCTGCGAA GCTGGGGCGG CGTCATGGAC ATGTCGATGG ATGGTTCGCC GATCATCGAT GAAACGCCGG TGAGCAATCT CTTCCTGAAC GCCGGCTGGT GCTACGGAGG GTTCAAGGCC ACGCCCGCAA GCGGCTGGTG CTTCGCCCAC CTGCTGGCGA CCGGCTCTTC GCACACGCTG GCGGCGCAGT ATCGCCTCGA CCGTTTCGCA ACCGGCGCGC TCATCAATGA AAAGGGCGAA GGTGCCCAAC CGAACCTTCA CTGA
|
Protein sequence | MRYSSATLIS QWLRKHRDWA PAWADRPLRP HYDVVVVGGG GHGLATAYYL AKNHGISRVA VLEKGWIGGG NVGRNTTIVR SNYLLDGNIP FYEWSLRLWE GLEQDLNYNV MVSQRGVLNL YHSDSQRDEA ARRGNAMRLH GVDADLLDRD RVRRMAPFLD FENARFPIMG GLLQPRGGTV RHDAVAWGYA RAASALGVDI IQDCEVLGIE RDASGAAVGL ETNRGRIGAG RIGLAVAGNS SRVAALANLK LPIESHVLQA FVSEGLKPVI PGVITFGAGH FYISQSDKGG LVFGGDLDGY NSYAQRGQMT TVEDVCESGM AVMPMIGRAR ILRSWGGVMD MSMDGSPIID ETPVSNLFLN AGWCYGGFKA TPASGWCFAH LLATGSSHTL AAQYRLDRFA TGALINEKGE GAQPNLH
|
| |