Gene Caul_2366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2366 
Symbol 
ID5899821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2566892 
End bp2568145 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content64% 
IMG OID641562857 
Productsarcosine oxidase beta subunit family protein 
Protein accessionYP_001683991 
Protein GI167646328 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTATT CCTCGGCGAC CTTAATTTCG CAGTGGCTTC GAAAGCATCG CGACTGGGCT 
CCTGCCTGGG CCGACCGGCC TTTGCGCCCT CATTATGATG TCGTTGTGGT TGGCGGTGGC
GGACACGGCC TCGCGACCGC CTACTACTTG GCGAAGAACC ACGGCATCAG CCGCGTAGCG
GTGCTCGAAA AAGGCTGGAT CGGCGGCGGC AACGTTGGCC GAAACACGAC GATTGTCCGG
TCCAACTATC TCTTGGACGG CAACATCCCG TTCTACGAAT GGTCGCTTCG GCTTTGGGAA
GGGCTTGAGC AGGACCTGAA CTACAACGTC ATGGTCAGCC AGCGCGGTGT TCTAAACCTC
TATCACTCCG ACTCACAGCG AGATGAGGCC GCCAGGCGCG GCAATGCGAT GCGGCTCCAC
GGCGTGGATG CGGACCTGCT CGATCGCGAT CGCGTCCGCC GAATGGCGCC GTTCCTGGAT
TTCGAGAACG CGCGCTTTCC CATCATGGGC GGCCTGCTCC AGCCGCGCGG CGGAACGGTG
CGCCACGACG CCGTCGCCTG GGGCTACGCC AGGGCCGCGT CGGCGCTCGG GGTCGACATC
ATTCAAGACT GCGAGGTGCT CGGGATCGAG CGCGATGCGT CGGGCGCGGC GGTCGGGCTG
GAAACGAACC GCGGCCGGAT CGGCGCTGGA CGGATTGGCC TTGCGGTCGC TGGCAACAGC
TCCCGCGTCG CCGCGCTGGC CAACCTGAAG CTGCCGATCG AAAGCCACGT GCTGCAGGCC
TTCGTCTCCG AAGGTCTCAA GCCGGTCATT CCGGGCGTCA TAACCTTTGG AGCGGGGCAC
TTCTATATCA GCCAGTCCGA CAAGGGCGGA CTGGTCTTCG GCGGCGATCT CGACGGGTAC
AACAGCTATG CCCAGCGCGG GCAGATGACC ACCGTGGAGG ATGTGTGCGA GAGCGGCATG
GCTGTGATGC CGATGATCGG GCGCGCCCGC ATCCTGCGAA GCTGGGGCGG CGTCATGGAC
ATGTCGATGG ATGGTTCGCC GATCATCGAT GAAACGCCGG TGAGCAATCT CTTCCTGAAC
GCCGGCTGGT GCTACGGAGG GTTCAAGGCC ACGCCCGCAA GCGGCTGGTG CTTCGCCCAC
CTGCTGGCGA CCGGCTCTTC GCACACGCTG GCGGCGCAGT ATCGCCTCGA CCGTTTCGCA
ACCGGCGCGC TCATCAATGA AAAGGGCGAA GGTGCCCAAC CGAACCTTCA CTGA
 
Protein sequence
MRYSSATLIS QWLRKHRDWA PAWADRPLRP HYDVVVVGGG GHGLATAYYL AKNHGISRVA 
VLEKGWIGGG NVGRNTTIVR SNYLLDGNIP FYEWSLRLWE GLEQDLNYNV MVSQRGVLNL
YHSDSQRDEA ARRGNAMRLH GVDADLLDRD RVRRMAPFLD FENARFPIMG GLLQPRGGTV
RHDAVAWGYA RAASALGVDI IQDCEVLGIE RDASGAAVGL ETNRGRIGAG RIGLAVAGNS
SRVAALANLK LPIESHVLQA FVSEGLKPVI PGVITFGAGH FYISQSDKGG LVFGGDLDGY
NSYAQRGQMT TVEDVCESGM AVMPMIGRAR ILRSWGGVMD MSMDGSPIID ETPVSNLFLN
AGWCYGGFKA TPASGWCFAH LLATGSSHTL AAQYRLDRFA TGALINEKGE GAQPNLH