Gene Caul_4979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4979 
Symbol 
ID5902441 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5383519 
End bp5384832 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content72% 
IMG OID641565500 
ProductFolC bifunctional protein 
Protein accessionYP_001686597 
Protein GI167648934 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC ATCTCCGCGC CCACGACGCC GCGCTGGCGC GGCTGCAGGC CCTGCACCCC 
AAGCTGATCG ACCTGTCTCT GGACCGCATG CGGCGGCTGT GCACGGCCCT GGGCGAGCCC
CAGAAGCGCC TGCCGCCGGT GATTCACGTG GCCGGCACCA ATGGCAAGGG CTCGACCGTC
GCCTATCTGA GGGCGATGGC CGAGGCCGCC GGCCTGACGG TTCACGTCTT CACCTCGCCC
CACCTGGTGC GGTTCGCCGA GCGCATCCGC CTGGCCGGAA CCCTGATCAC CGACGCGCAC
CTGGCCGACG TGCTGGAACG GGTCGAGGCG GCCAATGCCG GCCTGCCGAT CACTTTCTTC
GAGATCACCA CCGCCGCCGC CTTGCAGGCC TTTTCCGAGG TCCCGGCCGA CCTGTGCCTG
GTCGAGGTGG GCCTGGGCGG GGTGCTGGAC GCGACCAATG TCGTCAGCCC CGTGGTCAGC
GTCATCGCCC CGATCGACAT CGACCACCGC GAATTCCTCG GCGACACCCT GGCGGCCATC
GCCCAGGAGA AAGCCGGGAT CATCAAGCCC AACACCCCCG TCGTCTCGGC CCGCCAGGCC
GAAGAGGCCG AGCGGGTCGT CGAGCGCGAG GCCGACCTCT CCGAGGCGCC CCTGACCCTG
ATGGGCCGCG ATTTCGACGC CTGGAACGAG CGCGGCCGGC TGCTGGTGCA ACTTCAGGAC
CGCCTGCTGG ACCTGCCCGC CCCGTCCCTG CCCGGCGAGC ACCAGTTCGC CAATGCCGGC
CTGGCCGTGG CGGCCATCCT GACCCTGAAC GACCCGCGCA TCGACGAGGC CGCCATGGCC
CGGGGAATCG CGGCCACGAC CTGGCCGGCG CGGTTCCAGC GGCTGACGGC CGGTCCCCTG
GCCGAACGCG CCAAGGCGGC GGACGCCGAT CTCTGGCTGG ACGGCGGCCA TAACCCCCAT
GCCGGCCTGG CCGTGGCCCG GGCGCTGGGC GACCTGGCGG CGCGCGACGG CCGCCCGGTG
GCGCTGATCG CCGGCCTGCT GGCCAACAAG GACGCCACCG GCTTCTTCGC GCCGTTCGCG
TCGCTGAAGG CCCGGCTGTT TTCGGTGACG TTCGAAGGCC ACGCCGCCGC TAGCGCCGCC
CAGACGGCGG CGGCGGCCGA GCTGGCGGGA ATTCGCGCCC ACGCCTGCGA CAGCGTGCGC
GAGGCGCTCG ACAAGGCCCT GGCGATCGAG CCAACGCCGC ACGTGCTGAT CTGCGGCTCG
CTCTACCTGG CCGGCGAAGT GCTGGCGATG AGCCCGGAGA CCTGGCCGGT CTAA
 
Protein sequence
MTDHLRAHDA ALARLQALHP KLIDLSLDRM RRLCTALGEP QKRLPPVIHV AGTNGKGSTV 
AYLRAMAEAA GLTVHVFTSP HLVRFAERIR LAGTLITDAH LADVLERVEA ANAGLPITFF
EITTAAALQA FSEVPADLCL VEVGLGGVLD ATNVVSPVVS VIAPIDIDHR EFLGDTLAAI
AQEKAGIIKP NTPVVSARQA EEAERVVERE ADLSEAPLTL MGRDFDAWNE RGRLLVQLQD
RLLDLPAPSL PGEHQFANAG LAVAAILTLN DPRIDEAAMA RGIAATTWPA RFQRLTAGPL
AERAKAADAD LWLDGGHNPH AGLAVARALG DLAARDGRPV ALIAGLLANK DATGFFAPFA
SLKARLFSVT FEGHAAASAA QTAAAAELAG IRAHACDSVR EALDKALAIE PTPHVLICGS
LYLAGEVLAM SPETWPV