Gene Caul_4951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4951 
Symbol 
ID5902413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5353431 
End bp5354876 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content70% 
IMG OID641565471 
Productputative oxidoreductase 
Protein accessionYP_001686569 
Protein GI167648906 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases 
TIGRFAM ID[TIGR01317] glutamate synthases, NADH/NADPH, small subunit
[TIGR01318] glutamate synthase small subunit family protein, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.279273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGC GCATGCTGAA ATTCACGACC GTCGCCCGCG CGACGCCCGA AAAGCGGGCG 
GCGGACGAGC GGAATGGTGA TTTCCACGAG ATCTACGCCG ACTTCATCGA CGCCAAGGCC
AGCGAGCAGG CGTCGCGTTG CTCGCAATGC GGCGTTCCGT TCTGCCAGAC CCACTGTCCG
CTGCACAACA ACATCCCCGA CTGGCTGCGG ATGACCGCCG AAGGCCGGCT GGAGGAAGCC
TACCAGCTGT CGCAGGCCAC CAATTCCATG CCGGAAGTCT GCGGGAGAAT CTGCCCGCAG
GATCGCCTCT GCGAAGGCAA CTGCGTCATA GAGCAATCGG GTCACGGCAC GGTGACGATC
GGCTCGGTCG AGCGCTACCT GACCGACAAG GCCTGGGAGA TGGGCTGGGT GAAGCCGCTG
GTCGCCGGCG CGGACCGCGG CCAGTCGGTG GGAATCATCG GCGCTGGTCC CGCCGGCCTC
GCCGCCGCCG AGCTGCTTCG CGAGCAGGGC TACGCCGTCA CCGTCTATGA CCGCCACGAC
CGGGCCGGCG GCCTGCTGAT CTATGGCATC CCCGGCTTCA AGCTGGAGAA GGACGTCGTC
GAGCGCCGCA CGAGGCGCCT GGCCGACGGC GGCGTGGTGT TCAAGCTGGG CTTCGAGGTC
GGCCGCGACG CTGCCCTGCG GGACCTGCGC GACCAGCACG ACGCCGTGCT GATCGCCGTC
GGCGTCTATG CCGCCCGCGA CCTGGTCGCG CCCGGCGCGG GCAGCCAGGG CGTCGTGCCG
GCGCTGGACT ATCTGATCGC CTCCAACCGC ACGGTCCTGG GCGACAGCGT CCCCGCCTAC
GAGAGCGGCG TGCTCAACGC GGAAGGCAAG GACGTGGTCG TGATCGGCGG CGGCGACACC
GCCATGGACT GCGTGCGCAC GGCCGTTCGC CAGGGCGCGA CCTCGGTCAC CTGCCTCTAT
CGCCGCGACA AGGCCAACAT GCCCGGCTCG ATGCGCGAAG TGTCCAACGC CGAGGAAGAG
GGCGTGGTGT TCGAATGGCT GGCCGCCCCG CGCGCCCTCG GCGGCGACGC CGAGGCCGTG
ACCGGCGTGC GCGCCATCCG CATGCGCCTG GGCGCTCCGG ACGCTTCGGG TCGCCAGAGC
CCGGAAGAGA TCGACGGCGG CGACTTCGAC CTTCCGGCCC AGCTGGTGGT CAAGGCCCTG
GGCTTCGAGC CCGAGAACCT GCCCGAACTG TGGTCGGCCC CCGACCTGAA GGTCACCCGC
TGGGGCACGG TCAAGGCCGA CGTTCGTCAC CAGATGACCA ATCTGGACGG CGTGTTCGCG
GCCGGCGACA TCGTGCGCGG CGCCTCCCTG GTGGTCTGGG CGATCAAGGA CGGCCGCGAC
GCCGCCGACG CCATGCACAA GTACCTGCAG GCCAAGGTGG CGGCGGTTTC GATCGCGGCG
GAGTAA
 
Protein sequence
MAERMLKFTT VARATPEKRA ADERNGDFHE IYADFIDAKA SEQASRCSQC GVPFCQTHCP 
LHNNIPDWLR MTAEGRLEEA YQLSQATNSM PEVCGRICPQ DRLCEGNCVI EQSGHGTVTI
GSVERYLTDK AWEMGWVKPL VAGADRGQSV GIIGAGPAGL AAAELLREQG YAVTVYDRHD
RAGGLLIYGI PGFKLEKDVV ERRTRRLADG GVVFKLGFEV GRDAALRDLR DQHDAVLIAV
GVYAARDLVA PGAGSQGVVP ALDYLIASNR TVLGDSVPAY ESGVLNAEGK DVVVIGGGDT
AMDCVRTAVR QGATSVTCLY RRDKANMPGS MREVSNAEEE GVVFEWLAAP RALGGDAEAV
TGVRAIRMRL GAPDASGRQS PEEIDGGDFD LPAQLVVKAL GFEPENLPEL WSAPDLKVTR
WGTVKADVRH QMTNLDGVFA AGDIVRGASL VVWAIKDGRD AADAMHKYLQ AKVAAVSIAA
E