Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2364 |
Symbol | |
ID | 5899819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2563670 |
End bp | 2566624 |
Gene Length | 2955 bp |
Protein Length | 984 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641562855 |
Product | sarcosine oxidase alpha subunit family protein |
Protein accession | YP_001683989 |
Protein GI | 167646326 |
COG category | [E] Amino acid transport and metabolism [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0404] Glycine cleavage system T protein (aminomethyltransferase) [COG0492] Thioredoxin reductase |
TIGRFAM ID | [TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.442559 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCGCC TACGAAGCGG CGGGCAGTTG GATCGCTCGC GAGCGCTAGG CTTCAAGTTT GATGGTCGTG AGCTTTCCGG GTTCGCGGGT GACACGCTAG CTTCCGCGCT GGTCGCCAAC GACGTGAAGC TGGTCGGTCG GTCCTTCAAG TATCATCGCC CCAGAGGATT GCTTTCCGCC GGCTCGGAAG AGCCCAATGG GCTGGTCACG TTGCGCGAGG GGGCGCAGGC CGAACCGAAC ACCCGGGCGA CGCAGGTTGA ACTGTTCGAC GGCCTGCGAG CGACCAGTCA AAATCGCTGG CCCAATCTTC GCTTCGACCT CCTGGCCCTC AACCAGGCGG CCGCGCCGCT TCTGGTCGCG GGGTTTTACT ACAAGACCTT CATGTGGCCG GCGGCCTTCT GGGAGCGCGT CTACGAGCCG CTGATCCGCC GCGCGGCTGG CTTGGGGAAA CTGTCTTCGC TCCCCGATCC CGATCACTAC GATCGCGAGC ATGGTTTCGG CGACCTTCTT GTCATTGGCG GCGGACCCGC TGGCCTGGCC GCTGCTCTGG CCGCGGGGCG CTCTGGCCTG CGCGTGATCC TTGCAAACGA GGATTTCCTG CTTGGCGGTC GCCTGCTTTC CGAGTCGCAT TTGATCGCCG ACAGGCCCGG CGGTGTCTGG GCTGGCGATA CGGTCCGTGA ACTCTACACG ATGCCGAACG TGCGGATCCT CAATCGCACT ACGATCGTCG GCGCCTACGA CGGTCGCGAA TATATCGCCG TCGAAAGGCT CACCGATCAT CTGGCCAAGC CGGAAGGCCA TGGCGCTCGG CAGCGACTGT GGAAGATCAT CGCGCGCGAA GCGGTCCTCG CGTCGGGCGC CATCGAAAGG CCGCTCGTCT TTGGGGGCAA TGATCGGCCG GGCGTGATGC TGGCCTCGGC GGTGTCGACC TACATCAATC GCTTCGCGGC GCTGCCGGGC AAGCGAGCTG TGGTTTTCAC CACGGGGGAC AGCGGATGGC GCACGGCCGC CGATCTGATC GCCGCCGGCG CCGAGATCGC GGCGATCGCG GATGCCCGCA GCGAGGTTCC GGCGCAGGCG CGGGCGCTGG TCTCCCGACA GGTTCCGACG TATCTCTGCG CACGGATAGG CGACGCGCAT GGCGCGCCGG TTCGTTCGGT CGATCTGTAT GCCGGCCAAG AGCGGCATCG GATACGCGCC GATCTCGTCG CGATGGCCGG TGGTTGGAAC CCAGCCATCG GGCTTGGTTC CAACCTGGGC TCGCGGCCGG TCTGGTCGGA AGCGCTGGAC ACCTTCATTC TGCACAAGGG ACCGCCTGGC CTTCGCTTGG CCGGCGCCGC GAATGGCAGA TACTCGCTCG GCGAAGCCGT GCGCGACGGC TGGACTAGCG GAAGCGAGGC GGCCCGCGCG CTTGGCCGCC CGGCCCCCAA GTCGCCCAAC CTAGCGGCGA GCGATGACCC ATCCTCGGCG CGAGCGCTAT GGCATGTGGC CGAGCGCCGG GGGACGGCGT TCGTCGACTA TCAGAACGAC GTCACCGACA AGGATATCGA CCTCGCTGCT CAGGAAGGGT TCAGGTCCGT CGAGCACATG AAGCGATACA CCACGCTTGG CATGGCGACC GATCAAGGAA AGACCAGTGG CGTCAATGGT CATGCGCTTC TCGCCCGGGC GACGGGCAGA TCGCTGAGCG AAACGGGCAC GATCTTGTCG CGTCCTCCGT GGCAGCCGGT GGCCATCGGC GTGCTTGCGG GCCATCACCG CGGTCGTGAT TTCAAACCCG AGCGCCTGGC CCCCAGCCAT CGCTGGGCCG CTGAACAAGG CGCGGCGTTC CTGGATGTCG GCCTATGGAA GCGCGCCCAA TGGTTCCCCA GGCCTGGCGA CAAGGACTGG CGCGCCACGG TCGATCGCGA AGTTCGGCTG ACAAGGACGG GCGTTGGCGT TTGCGACGTC TCCACTCTTG GCAAGATCGA CATCCATGGG CCGGATGCCG GCGCGTTCCT CGATCGGCTT TACACGGGCA CCTTTTCGAC CTTGGCCGTC GGGCGCGCCC GCTACGGCGT GATGTTGCGC GAGGACGGGT TTGTCTTTGA CGACGGGACG ACGACCCGCT TCGCGCCAGA CCGCTATTTT CTGACGACGA CGACGGTCAA CGCCGGGCGA GTCATGCAGC ATATTGACTA CGCAAGACAG GTGCTGTGGC CGGAACTCGA CGTCCAGGCC GTCTCGGTAA CCGAGCAATG GGCCAGCTTC TCCATCGCTG GACCCGCGTC GCGCGCCCTG ATCGCGGATC TGTTGTCAGG CTTCGATGTG TCCAACGCGT CATTCGCACC GATGGCCGCC GCGGAACTGG AATGGGAAGG ACTGCCCGCC CGGCTGTTTC GGCTGTCGTT CTCCGGGGAG CTTGCCTACG AGCTTTGCGT GCCGGCCAGC GCAGGCGACG CCTTGGTGCG CCGACTTTTT GAGCTGGGAG CGCCATACGG CGTCACGCCC TACGGCACCG AAGCGCTTGG CGTGATGCGG ATCGAGAAAG GGCATGTCGC GGGTCCGGAG TTGAACGGGC AAACCACAGC CGCCGATCTG GGCCTGGGTC GGATGATGTC CACCAAGAAG GACTATATCG GCCGTGTCCT TTCGGGCCGA CCGGCGCTCG TCGATCCGGA CCGCCCGGTG CTGGTCGGGC TGGTTCCTGT CGATCGTGGT CAGACCTTCG CCGGCGGCGC GCATCTCGTC CCGCCTGGGC GCGCCGCTGT CGCGCGGAAT GTGGAGGGGC ATGTCACCTC GGTCGCCTTC TCGCCGACGC TCGGTCACGG CATCGCCCTG GCGCTTCTGG CGCGCGGGCG AGAGCGGCAT GGCCAACGCA TCGTCGCGCA TGATCCCGTA CGCGGCATGA GCGTGGAGGC GCACGTCAGC GATCCTGTGT TCTTCGACCC AGAGGGAGCG CGCGCCCGTG GCTGA
|
Protein sequence | MTRLRSGGQL DRSRALGFKF DGRELSGFAG DTLASALVAN DVKLVGRSFK YHRPRGLLSA GSEEPNGLVT LREGAQAEPN TRATQVELFD GLRATSQNRW PNLRFDLLAL NQAAAPLLVA GFYYKTFMWP AAFWERVYEP LIRRAAGLGK LSSLPDPDHY DREHGFGDLL VIGGGPAGLA AALAAGRSGL RVILANEDFL LGGRLLSESH LIADRPGGVW AGDTVRELYT MPNVRILNRT TIVGAYDGRE YIAVERLTDH LAKPEGHGAR QRLWKIIARE AVLASGAIER PLVFGGNDRP GVMLASAVST YINRFAALPG KRAVVFTTGD SGWRTAADLI AAGAEIAAIA DARSEVPAQA RALVSRQVPT YLCARIGDAH GAPVRSVDLY AGQERHRIRA DLVAMAGGWN PAIGLGSNLG SRPVWSEALD TFILHKGPPG LRLAGAANGR YSLGEAVRDG WTSGSEAARA LGRPAPKSPN LAASDDPSSA RALWHVAERR GTAFVDYQND VTDKDIDLAA QEGFRSVEHM KRYTTLGMAT DQGKTSGVNG HALLARATGR SLSETGTILS RPPWQPVAIG VLAGHHRGRD FKPERLAPSH RWAAEQGAAF LDVGLWKRAQ WFPRPGDKDW RATVDREVRL TRTGVGVCDV STLGKIDIHG PDAGAFLDRL YTGTFSTLAV GRARYGVMLR EDGFVFDDGT TTRFAPDRYF LTTTTVNAGR VMQHIDYARQ VLWPELDVQA VSVTEQWASF SIAGPASRAL IADLLSGFDV SNASFAPMAA AELEWEGLPA RLFRLSFSGE LAYELCVPAS AGDALVRRLF ELGAPYGVTP YGTEALGVMR IEKGHVAGPE LNGQTTAADL GLGRMMSTKK DYIGRVLSGR PALVDPDRPV LVGLVPVDRG QTFAGGAHLV PPGRAAVARN VEGHVTSVAF SPTLGHGIAL ALLARGRERH GQRIVAHDPV RGMSVEAHVS DPVFFDPEGA RARG
|
| |