Gene Caul_4661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4661 
Symbol 
ID5902123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5037339 
End bp5039189 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content65% 
IMG OID641565180 
Productribonucleotide-diphosphate reductase subunit alpha 
Protein accessionYP_001686279 
Protein GI167648616 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0209] Ribonucleotide reductase, alpha subunit 
TIGRFAM ID[TIGR02506] ribonucleoside-diphosphate reductase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.590827 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCTC TCCCTCAGGC CCGCACCGGC GGCATGAAGG TTGAGCGTCC GAAACTGGCG 
CTGGTGCGCA AGGTCGAGGT CGACCGTTCG CGCGACGCCC TGCTGACCGA TTTTGGCAAG
ACCACGCTGG AAGACCGCTA TCTGCTCCCG GGCGAGTCGT ACCAGGACAT GTTCGCCCGC
GTGTCGACGG CCTTCGCCGA CGACGCCGAC CATGCCCAGC GCGTCTACGA CTACATGAGC
AAGCTGTGGT TCATGCCGTC GACCCCGGTG CTCAGCAACG GCGGCGCCGA ACGCGGCCTG
CCGATCAGCT GCTTCCTCAA TGCGGTCAGC GACAGCCTGG ACGGCATCCT GGGCGTCTGG
AACGAGAACG TCTGGCTGGC GGCCAACGGC GGCGGCATCG GCACCTACTG GGGGGGCGTG
CGGTCGATCG GCGAGAAGGT CAAGGGTCAG GGCCAGACCA GCGGCATCAT TCCCTTCATC
CGCGTGATGG ACAGCCTGAC CCTGGCGATC AGCCAAGGGT CGCTGCGCCG CGGCTCGGCG
GCCGTCTATC TCGACATCTT CCATCCGGAG ATCGAAGAGT TCCTCGAGAT CCGCAAAGCC
TCGGGCGACT TCAACCGCAA GTCCCTGAAC CTGCACCACG GCATCTCGAT CACCGACGAG
TTCATGCACG CGGTGCGTGA CGGCCACAAG TTCGGCCTGC GCTCGCCCAA GACGGGCGAG
GTCCTGCGCG AAGTTGACGC CCGCGCCCTG TGGCAGAAAG TTCTGGAGCT GCGGCTGCAG
ACCGGCGAGC CCTACCTGAT CTTCTCCGAC ACCGTGAACC GCGCCATGCC CAAGCACCAG
CAAGAGCTGG GCCTGAAGGT TCGCCAGTCC AACCTGTGCA GTGAGATCAT GCTGCACACC
GGCGTCGACC ACCTGGGCAA CGACCGCACG GCGGTCTGCT GCCTGTCGTC GGTGAACGCC
GAGACCTTCC TGGAGTGGCG CGACCATCCG ATGTTCATCG AGGACATCAT GCGCTTCCTC
GACAACGTCC TGCAGGACTT CATCGATCGG GCGCCCGACG CGGCCGCCAC GGCCGCCTAC
GCCGCCATGC GCGAGCGTTC CGTGGGCCTG GGCCTGATGG GCTTCCACAG CTTCCTGCAG
AGCCAGAACG TGCCGTTCGA GAGCGCCCTG GCCAAGAGCT GGAACATGCG GATGTTCAAG
CACCTGCGCC GCGAAGCCGA CAAGGCGTCG ATCACCATCG GCGAAGAGAA GGGGCCGTGC
CCGGACGCCG CCGACCGCGG CTCTATGGAG CGCTTCTCGC ACAAGCTGGC CATCGCCCCG
ACCGCGTCGA TCTCGATCAT CTGCGGCGGC ACGTCGGCGG GCATCGAGCC GATCCCTGCC
AACATCTACA CCCACAAGAC CCTGTCGGGA TCGTTCGCGG TGAAGAACCC CTACCTGGAG
AAAGTGCTCG AGGAGAAGGG TCACAACACC GACGCCGTCT GGGGTTCGAT CCTCGAGAAC
GAGGGCTCGG TCCAGCACCT GGACTTCCTC AGCCAGGACG ACAAGGACGT CTACAAGACC
GCCTTCGAGC TGGACCAGCG CTGGGTGGTC GAGCTGGCCG CCGATCGCAC GCCGGAAGTC
TGCCAGAGCC AGTCGGTGAA CATCTTCCTG CCCGGCGACG TCGACAAGTG GGACCTGCAC
ATGCTGCACT GGCAGGCCTG GGAGCGCGGC GTCAAATCGC TGTACTACCT GCGCTCCAAG
TCGGTGCAGC GGGCGTCCTA CGCCGGTTCA GACGTCGCCT TGGCGGGTCC CGCCAACGGC
TTCGACGCTC CGTCCAAAAC TGACTACGAG GAATGCCTGG CCTGTCAGTA G
 
Protein sequence
MTALPQARTG GMKVERPKLA LVRKVEVDRS RDALLTDFGK TTLEDRYLLP GESYQDMFAR 
VSTAFADDAD HAQRVYDYMS KLWFMPSTPV LSNGGAERGL PISCFLNAVS DSLDGILGVW
NENVWLAANG GGIGTYWGGV RSIGEKVKGQ GQTSGIIPFI RVMDSLTLAI SQGSLRRGSA
AVYLDIFHPE IEEFLEIRKA SGDFNRKSLN LHHGISITDE FMHAVRDGHK FGLRSPKTGE
VLREVDARAL WQKVLELRLQ TGEPYLIFSD TVNRAMPKHQ QELGLKVRQS NLCSEIMLHT
GVDHLGNDRT AVCCLSSVNA ETFLEWRDHP MFIEDIMRFL DNVLQDFIDR APDAAATAAY
AAMRERSVGL GLMGFHSFLQ SQNVPFESAL AKSWNMRMFK HLRREADKAS ITIGEEKGPC
PDAADRGSME RFSHKLAIAP TASISIICGG TSAGIEPIPA NIYTHKTLSG SFAVKNPYLE
KVLEEKGHNT DAVWGSILEN EGSVQHLDFL SQDDKDVYKT AFELDQRWVV ELAADRTPEV
CQSQSVNIFL PGDVDKWDLH MLHWQAWERG VKSLYYLRSK SVQRASYAGS DVALAGPANG
FDAPSKTDYE ECLACQ