Gene Caul_2364 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2364 
Symbol 
ID5899819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2563670 
End bp2566624 
Gene Length2955 bp 
Protein Length984 aa 
Translation table11 
GC content66% 
IMG OID641562855 
Productsarcosine oxidase alpha subunit family protein 
Protein accessionYP_001683989 
Protein GI167646326 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase)
[COG0492] Thioredoxin reductase 
TIGRFAM ID[TIGR01372] sarcosine oxidase, alpha subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.442559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGCC TACGAAGCGG CGGGCAGTTG GATCGCTCGC GAGCGCTAGG CTTCAAGTTT 
GATGGTCGTG AGCTTTCCGG GTTCGCGGGT GACACGCTAG CTTCCGCGCT GGTCGCCAAC
GACGTGAAGC TGGTCGGTCG GTCCTTCAAG TATCATCGCC CCAGAGGATT GCTTTCCGCC
GGCTCGGAAG AGCCCAATGG GCTGGTCACG TTGCGCGAGG GGGCGCAGGC CGAACCGAAC
ACCCGGGCGA CGCAGGTTGA ACTGTTCGAC GGCCTGCGAG CGACCAGTCA AAATCGCTGG
CCCAATCTTC GCTTCGACCT CCTGGCCCTC AACCAGGCGG CCGCGCCGCT TCTGGTCGCG
GGGTTTTACT ACAAGACCTT CATGTGGCCG GCGGCCTTCT GGGAGCGCGT CTACGAGCCG
CTGATCCGCC GCGCGGCTGG CTTGGGGAAA CTGTCTTCGC TCCCCGATCC CGATCACTAC
GATCGCGAGC ATGGTTTCGG CGACCTTCTT GTCATTGGCG GCGGACCCGC TGGCCTGGCC
GCTGCTCTGG CCGCGGGGCG CTCTGGCCTG CGCGTGATCC TTGCAAACGA GGATTTCCTG
CTTGGCGGTC GCCTGCTTTC CGAGTCGCAT TTGATCGCCG ACAGGCCCGG CGGTGTCTGG
GCTGGCGATA CGGTCCGTGA ACTCTACACG ATGCCGAACG TGCGGATCCT CAATCGCACT
ACGATCGTCG GCGCCTACGA CGGTCGCGAA TATATCGCCG TCGAAAGGCT CACCGATCAT
CTGGCCAAGC CGGAAGGCCA TGGCGCTCGG CAGCGACTGT GGAAGATCAT CGCGCGCGAA
GCGGTCCTCG CGTCGGGCGC CATCGAAAGG CCGCTCGTCT TTGGGGGCAA TGATCGGCCG
GGCGTGATGC TGGCCTCGGC GGTGTCGACC TACATCAATC GCTTCGCGGC GCTGCCGGGC
AAGCGAGCTG TGGTTTTCAC CACGGGGGAC AGCGGATGGC GCACGGCCGC CGATCTGATC
GCCGCCGGCG CCGAGATCGC GGCGATCGCG GATGCCCGCA GCGAGGTTCC GGCGCAGGCG
CGGGCGCTGG TCTCCCGACA GGTTCCGACG TATCTCTGCG CACGGATAGG CGACGCGCAT
GGCGCGCCGG TTCGTTCGGT CGATCTGTAT GCCGGCCAAG AGCGGCATCG GATACGCGCC
GATCTCGTCG CGATGGCCGG TGGTTGGAAC CCAGCCATCG GGCTTGGTTC CAACCTGGGC
TCGCGGCCGG TCTGGTCGGA AGCGCTGGAC ACCTTCATTC TGCACAAGGG ACCGCCTGGC
CTTCGCTTGG CCGGCGCCGC GAATGGCAGA TACTCGCTCG GCGAAGCCGT GCGCGACGGC
TGGACTAGCG GAAGCGAGGC GGCCCGCGCG CTTGGCCGCC CGGCCCCCAA GTCGCCCAAC
CTAGCGGCGA GCGATGACCC ATCCTCGGCG CGAGCGCTAT GGCATGTGGC CGAGCGCCGG
GGGACGGCGT TCGTCGACTA TCAGAACGAC GTCACCGACA AGGATATCGA CCTCGCTGCT
CAGGAAGGGT TCAGGTCCGT CGAGCACATG AAGCGATACA CCACGCTTGG CATGGCGACC
GATCAAGGAA AGACCAGTGG CGTCAATGGT CATGCGCTTC TCGCCCGGGC GACGGGCAGA
TCGCTGAGCG AAACGGGCAC GATCTTGTCG CGTCCTCCGT GGCAGCCGGT GGCCATCGGC
GTGCTTGCGG GCCATCACCG CGGTCGTGAT TTCAAACCCG AGCGCCTGGC CCCCAGCCAT
CGCTGGGCCG CTGAACAAGG CGCGGCGTTC CTGGATGTCG GCCTATGGAA GCGCGCCCAA
TGGTTCCCCA GGCCTGGCGA CAAGGACTGG CGCGCCACGG TCGATCGCGA AGTTCGGCTG
ACAAGGACGG GCGTTGGCGT TTGCGACGTC TCCACTCTTG GCAAGATCGA CATCCATGGG
CCGGATGCCG GCGCGTTCCT CGATCGGCTT TACACGGGCA CCTTTTCGAC CTTGGCCGTC
GGGCGCGCCC GCTACGGCGT GATGTTGCGC GAGGACGGGT TTGTCTTTGA CGACGGGACG
ACGACCCGCT TCGCGCCAGA CCGCTATTTT CTGACGACGA CGACGGTCAA CGCCGGGCGA
GTCATGCAGC ATATTGACTA CGCAAGACAG GTGCTGTGGC CGGAACTCGA CGTCCAGGCC
GTCTCGGTAA CCGAGCAATG GGCCAGCTTC TCCATCGCTG GACCCGCGTC GCGCGCCCTG
ATCGCGGATC TGTTGTCAGG CTTCGATGTG TCCAACGCGT CATTCGCACC GATGGCCGCC
GCGGAACTGG AATGGGAAGG ACTGCCCGCC CGGCTGTTTC GGCTGTCGTT CTCCGGGGAG
CTTGCCTACG AGCTTTGCGT GCCGGCCAGC GCAGGCGACG CCTTGGTGCG CCGACTTTTT
GAGCTGGGAG CGCCATACGG CGTCACGCCC TACGGCACCG AAGCGCTTGG CGTGATGCGG
ATCGAGAAAG GGCATGTCGC GGGTCCGGAG TTGAACGGGC AAACCACAGC CGCCGATCTG
GGCCTGGGTC GGATGATGTC CACCAAGAAG GACTATATCG GCCGTGTCCT TTCGGGCCGA
CCGGCGCTCG TCGATCCGGA CCGCCCGGTG CTGGTCGGGC TGGTTCCTGT CGATCGTGGT
CAGACCTTCG CCGGCGGCGC GCATCTCGTC CCGCCTGGGC GCGCCGCTGT CGCGCGGAAT
GTGGAGGGGC ATGTCACCTC GGTCGCCTTC TCGCCGACGC TCGGTCACGG CATCGCCCTG
GCGCTTCTGG CGCGCGGGCG AGAGCGGCAT GGCCAACGCA TCGTCGCGCA TGATCCCGTA
CGCGGCATGA GCGTGGAGGC GCACGTCAGC GATCCTGTGT TCTTCGACCC AGAGGGAGCG
CGCGCCCGTG GCTGA
 
Protein sequence
MTRLRSGGQL DRSRALGFKF DGRELSGFAG DTLASALVAN DVKLVGRSFK YHRPRGLLSA 
GSEEPNGLVT LREGAQAEPN TRATQVELFD GLRATSQNRW PNLRFDLLAL NQAAAPLLVA
GFYYKTFMWP AAFWERVYEP LIRRAAGLGK LSSLPDPDHY DREHGFGDLL VIGGGPAGLA
AALAAGRSGL RVILANEDFL LGGRLLSESH LIADRPGGVW AGDTVRELYT MPNVRILNRT
TIVGAYDGRE YIAVERLTDH LAKPEGHGAR QRLWKIIARE AVLASGAIER PLVFGGNDRP
GVMLASAVST YINRFAALPG KRAVVFTTGD SGWRTAADLI AAGAEIAAIA DARSEVPAQA
RALVSRQVPT YLCARIGDAH GAPVRSVDLY AGQERHRIRA DLVAMAGGWN PAIGLGSNLG
SRPVWSEALD TFILHKGPPG LRLAGAANGR YSLGEAVRDG WTSGSEAARA LGRPAPKSPN
LAASDDPSSA RALWHVAERR GTAFVDYQND VTDKDIDLAA QEGFRSVEHM KRYTTLGMAT
DQGKTSGVNG HALLARATGR SLSETGTILS RPPWQPVAIG VLAGHHRGRD FKPERLAPSH
RWAAEQGAAF LDVGLWKRAQ WFPRPGDKDW RATVDREVRL TRTGVGVCDV STLGKIDIHG
PDAGAFLDRL YTGTFSTLAV GRARYGVMLR EDGFVFDDGT TTRFAPDRYF LTTTTVNAGR
VMQHIDYARQ VLWPELDVQA VSVTEQWASF SIAGPASRAL IADLLSGFDV SNASFAPMAA
AELEWEGLPA RLFRLSFSGE LAYELCVPAS AGDALVRRLF ELGAPYGVTP YGTEALGVMR
IEKGHVAGPE LNGQTTAADL GLGRMMSTKK DYIGRVLSGR PALVDPDRPV LVGLVPVDRG
QTFAGGAHLV PPGRAAVARN VEGHVTSVAF SPTLGHGIAL ALLARGRERH GQRIVAHDPV
RGMSVEAHVS DPVFFDPEGA RARG