Gene Caul_3455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3455 
Symbol 
ID5900910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3737904 
End bp3740117 
Gene Length2214 bp 
Protein Length737 aa 
Translation table11 
GC content69% 
IMG OID641563961 
Productphosphoribosylformylglycinamidine synthase II 
Protein accessionYP_001685080 
Protein GI167647417 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0046] Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain 
TIGRFAM ID[TIGR01736] phosphoribosylformylglycinamidine synthase II 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.821031 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCT CGCCCTCCCC CACGAAGTCG ATGGCCGACA CGGCCCGTGA ATTTGGCCTG 
AACGCGGAGG AGTACGCCGT CGTCCTCGAC CGCCTGGGCC GCGAGCCCAA CCTGGTCGAG
CTGGGGGTGT TCTCGGTGAT GTGGTCCGAG CACTGCTCGT ACAAGTCGTC CAAGAACCAG
CTGAAGAAGT TCCCCATCAC CGGCCCGCGG GTGATCTGCG GCCCCGGCGA GAACGCCGGC
GTGATCGACA TCGGCGACGG CGACGCCATC ATCTTCAAGA TGGAGAGCCA CAACCACCCC
TCCTACATCG AGCCCTACCA GGGCGCGGCG ACGGGCGTGG GCGGCATCAT GCGCGACGTC
TTCACCATGG GCGCGCGGCC GATCGCCTTG CTCAACGCCC TGCGGTTCGG CGATCCCAGC
CACCCCAAGA CCAAGCGCCT GGTCGACGGC GTGGTGGCCG GCATCGCTGG CTACGGCAAC
TGCGTGGGCG TGCCCACCGT GGCCGGCGAG ACCAATTTCC ATGCCGGCTA CAACGGCAAC
ATCCTGGTCA ACGCCATGTG CGTGGGCCTG GCCGACGCCG ACAAGATCTT CTATTCGGCC
GCGCCCGGGG CAGGCCTGTC GGTGGTCTAT TTCGGCTCGA AGACCGGCCG CGACGGCATC
CATGGCGCCA CCATGGCCAG CGCCGAGTTC GACCAGGACA GCGACGAGAA GCGCCCCACG
GTCCAGGTCG GCGACCCCTT CGCCGAGAAG CTGCTGATCG AGGCCACCCT GGAGCTGATG
GCCACCGGTG CCGTCGCCGC CATCCAGGAC ATGGGCGCGG CGGGCCTGAC CTCGTCGTCG
GTCGAGATGG CCGGCAAGGG CGGGGTCGGC ATCGAGCTGA ACATGGACAT GGTCCCCCAG
CGCGAAACCG GCATGAGCGC CTATGAGATG ATGCTGTCGG AAAGCCAGGA GCGCATGCTG
GCGGTGCTCA AGCCCGGCCG CGAGCGCGAA GGCCACGCCA TCTTCGAGAA GTGGGGCCTG
GACGCCGCCG TGATCGGCCA CACCACCGAC ACCGGCCGCC TGGTGCTCAA GCACCACGGC
GAGACCGTCT GCGACGTGCC GCTGGCCCCG CTGTTCGACG ACGCCCCGCT CTATGACCGC
CCCTGGGTGC AGCCGACGCT TCACCCGCGC CTCGACCCGG CCAAGGTCGA CGCCCCCGCC
GACTGGAACG CCGCCGTGCT CAAGGTGGTC GGCTGCCCCG ACATGGCCTC CAAGCGCTGG
CTGTGGGAGC AGTACGACCG CCACGTGATG GCCGACACCC TGGAAGACAG CGCCACCGGC
TGCGACGCCG GCATCGTGCG CATCCACGGC AAGGGCAAGG CCATTGCCGT GACCAGCGAC
GTCACGCCGC GCTATGTCCA GAACGACCCG TATGAAGGCG GCAAGCAGGC CGTGGCCGAG
GCCTGGCGCA ATCTGACGGC CGCCGGCTCG CTGCCGATCG CCATCACCGA CAACCTGAAC
TTCGGCAGCC CCGAAAAGCC CGAGACCATG GGCCAGATCG TCCGCGCCAC CGACGGCATG
GCCGAGGCCT GCCGCGTGCT GGACTTCCCG GTCGTGAGCG GCAATGTCAG CCTCTATAAC
GAGACCAACG GCGTCGCCAT TCCGCCGACC CCGACGGTGG GGGCGGTGGG CCTGATCGCC
GACTACGACC TGCGCATGGG CTTTGGCAAC GTGGCCGAGG GCGACAGCCT GGTGATCATC
GGCGAGACCC ATGGCGAACT GGGCGCCTCG ATCTATCTGC GCGAGATCCT GGGCCGCGAG
GATGGCGCCC CGCCCCCCGT CGACCTGGTC GTCGAGCGCA AGAACGGCGA TTTCGTGCGC
GGCCTGATCC CCACGGGTCT TGTCGCCGGC CTGCACGACC TGTCCGACGG CGGCCTGATC
ATCGCGGCGG CCGACATCGC CCTGGCCAGC AAGGTCGGGA TCACCTTGAA CGCCACCAGC
CAGGCTCACG CCCATCCCTA CCTGTTCGGC GAGGACCAGG CCCGCTATCT GGTCGCCACG
CCTGATCCGG ACGGTCTGCT GCTGGCGGCC AAGGAAGCCG GGGTCCACGC CAATGTCGCG
GGCGCCGTCG GCGGCGACGA CTTCGCCTCG ACGGGCCTGT TCTCCATTCC CATTGAAACC
CTGCGCGCGG CCCACGAGGC TTGGCTGCCG GGCTTCATGG GCGCCGTCGC TTAA
 
Protein sequence
MSTSPSPTKS MADTAREFGL NAEEYAVVLD RLGREPNLVE LGVFSVMWSE HCSYKSSKNQ 
LKKFPITGPR VICGPGENAG VIDIGDGDAI IFKMESHNHP SYIEPYQGAA TGVGGIMRDV
FTMGARPIAL LNALRFGDPS HPKTKRLVDG VVAGIAGYGN CVGVPTVAGE TNFHAGYNGN
ILVNAMCVGL ADADKIFYSA APGAGLSVVY FGSKTGRDGI HGATMASAEF DQDSDEKRPT
VQVGDPFAEK LLIEATLELM ATGAVAAIQD MGAAGLTSSS VEMAGKGGVG IELNMDMVPQ
RETGMSAYEM MLSESQERML AVLKPGRERE GHAIFEKWGL DAAVIGHTTD TGRLVLKHHG
ETVCDVPLAP LFDDAPLYDR PWVQPTLHPR LDPAKVDAPA DWNAAVLKVV GCPDMASKRW
LWEQYDRHVM ADTLEDSATG CDAGIVRIHG KGKAIAVTSD VTPRYVQNDP YEGGKQAVAE
AWRNLTAAGS LPIAITDNLN FGSPEKPETM GQIVRATDGM AEACRVLDFP VVSGNVSLYN
ETNGVAIPPT PTVGAVGLIA DYDLRMGFGN VAEGDSLVII GETHGELGAS IYLREILGRE
DGAPPPVDLV VERKNGDFVR GLIPTGLVAG LHDLSDGGLI IAAADIALAS KVGITLNATS
QAHAHPYLFG EDQARYLVAT PDPDGLLLAA KEAGVHANVA GAVGGDDFAS TGLFSIPIET
LRAAHEAWLP GFMGAVA