Gene Caul_5202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5202 
Symbol 
ID5897254 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp121074 
End bp122498 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content70% 
IMG OID641555305 
Producthypothetical protein 
Protein accessionYP_001676636 
Protein GI167621851 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.45217 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGGCC GCATCGGGCG CCGTCGCGCC ATCGCGCACA TGAACCTGGC CGCCGCCCAG 
GCGGGCCTGC GGGTCGGCCA GGCCGTCGCT CACGCCACGG CCCTGATCCC GGGCCTGTTG
CTGCACGACC TCGACGCGAC CGGCGATCAA GCCGGCTTGC ATCGCCTGGC GCTGTGGGCC
CAGAAACTCT ATTCGCCCAT CGTCGCCCCC GACGGCGCCG ATGGTCTGGT GATCGACGCC
AGCGGCTGCG CCCATCTCTT CGGCGGCGAG GAGAAGATGG CCATCGCCAT CCGCGAGCGA
CTGACCAAGG CCGGGTTCAG CGCCACCCTC GCCATCGCCG ACAGCTGGGG CGGCGCCCAT
GCCTTGGCCA GGTTCAGCGG CCGAGCGATC TTCGTGGTCG ATCCCGGCGC TACCGGCCGC
GAACTTCGCA GCCTGCCCGT CGCGGCCTTG CGCCTGGGTT CCGACCTCGT CGAAGGACTG
GGGCGCCTGG GGTTCGATAC GATCGGCGAG CTGGAGGCCA CGCCCAAAGG GCCGTTGGCG
CACCGCTTGG GCCTGGAGCC CGTGCGACGC CTCGACCAGG CGCTGGCGCG TCAGGCCGAG
CCCATCGAGC CGGTGATGGC CGCCCAGACC CTGTCGGTGC GCCGCGCCTT CGCCGAGCCG
ATCGGGGCGC CCGAGACGAT GGCTCGCTAT GTCACCCAAC TCACCCACGA ACTTTGCGCG
GCTCTGGAGG CCGCCAGTCT GGGCGCCAAG CGTCTGGACG CCTGGTTCTT CCGCGTCGAC
AACCGTATCG AGGCCGCCCG GATCGGCATG GCCGCTCCGA CACGTGACGG CGCGCGCCTG
GCCAAACTCC TGTGTGAGAA GCTGGAAAGG GTCGATCCGG GATTTGGGGT CGAGAAGATC
GTGCTGGCCG CGCCCGGCGC CGAGCCCCTG ACCTACAAGC AAGGCCAAGC GCTGGGCGAC
GGCGGCGCGG GCGTGGACCT GTCGGGTCTG ATCGACACCC TGTCCAACCG GATCGGCGCC
GAACACGTCT ACCGCCTGGC CTCCGCCCAG AGCGATTTGC CCGAACGCTC GGTCAAGCGC
GTTCCGGCCT TGCAGGCGCC TGACGGCTTT TCCTGGCCGA TGGACTGGCC GCGGCCCGAC
CGGTTTTTTG CCCGCCCCGA ATCCATCGAG ACCGTCGCCC TGCTTCCCGA CGCGCCGCCG
GCGGCCTTCA CTTGGCGTGG CGCCCGCCAC CGGGTGCGCT GCGCCGACGG ACCCGAGCGG
GTGTTTGGCG AATGGTGGAA GGCTGACGAG GAGTTGGCCC GCTCGCGCGA TTATTTCCAG
GTCGAGGACG AGGCCGGCGA GCGGTTCTGG ATCTTCCGCG ACGGCGACGG CGAGGACGCC
GAAACCGGCA CGCAGCGCTG GTACATGGTC GGGGTCTTCG GATGA
 
Protein sequence
MVGRIGRRRA IAHMNLAAAQ AGLRVGQAVA HATALIPGLL LHDLDATGDQ AGLHRLALWA 
QKLYSPIVAP DGADGLVIDA SGCAHLFGGE EKMAIAIRER LTKAGFSATL AIADSWGGAH
ALARFSGRAI FVVDPGATGR ELRSLPVAAL RLGSDLVEGL GRLGFDTIGE LEATPKGPLA
HRLGLEPVRR LDQALARQAE PIEPVMAAQT LSVRRAFAEP IGAPETMARY VTQLTHELCA
ALEAASLGAK RLDAWFFRVD NRIEAARIGM AAPTRDGARL AKLLCEKLER VDPGFGVEKI
VLAAPGAEPL TYKQGQALGD GGAGVDLSGL IDTLSNRIGA EHVYRLASAQ SDLPERSVKR
VPALQAPDGF SWPMDWPRPD RFFARPESIE TVALLPDAPP AAFTWRGARH RVRCADGPER
VFGEWWKADE ELARSRDYFQ VEDEAGERFW IFRDGDGEDA ETGTQRWYMV GVFG