Gene Caul_2161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2161 
Symbol 
ID5899616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2343848 
End bp2344969 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content68% 
IMG OID641562652 
Productradical SAM domain-containing protein 
Protein accessionYP_001683787 
Protein GI167646124 
COG category[L] Replication, recombination and repair 
COG ID[COG1533] DNA repair photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0231722 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCTCAG CGGTGGTCCA TGATCGGATC ATGGTCGCTC CCGCCAATCT GCCCGGTCAT 
CCGCCGTCGA ACGTTCGAGG GCGCGGGGCC AAGTCCAATC GCACCGGCCG CTTCGAGTCC
CAGGTCAGCG AGACCTTCGA CGACGGCTGG GGCGAGGAGG ACGAGCCCGC CCAGATCGCC
ACGACGCTGC AGCCGATGAA GTCGCGGACC ATCATCGCCC GCAACGACAG TCCCGATGTC
GGCTTCGAGA GCTCGATCAA CCCCTATCGC GGCTGCAGCC ACGGCTGCAT CTATTGCTAC
GCCCGCCCGG CCCACGCCTA TCTGGGCCAT TCGCCGGGCC TGGATTTCGA GACCAGGATC
TATTTCAAGC CCGAGGCCGG CAAGCTGCTG GAGCGCGAGC TGTCCAAGAA GGGCTATGCG
CCCAAGGTCA TCCATATCGG CGGCGACACC GATCCCTACC AGCCCGACGA GCGCCAACTG
CGGGTGACGC GGGCGGTGAT CGAGACCCTG GCGCGGTTCC GCCATCCGTT CACGATCATC
ACCAAGTCGG CCCTGATCAC CCGCGACCTG GATATCCTGG GGCCGATGGG CCAGGCGGGA
CTGGCGCGAG CGGCGGTGTC GATCACCAGC CTGGACCACC GATTGTCGCG CAGCATGGAG
CCCCGGGCCG CCACGCCGAA GCGTCGCCTC GACGCCGTGC GACAGCTGAC GGCGGCGGGC
GTGCCGACCA CGGTGATGTT CGCGCCTTCG ATTCCGTCGC TGAACGACCA TGAGATGGAA
GGCGTGCTCG AGGCCGCCGC CGCCGCCGGG GCGACCACGG CCGGCTATGT CGCTGTGCGC
CTGCCGCTGG AGATCAAGGA CCTGTTCGAG GAGTGGCTGG CGGCCGAGCA CCCCGACCGC
GCCAAGCGGG TGATGTCGCT GGTTCGCCAG ATGCGCGGCG GCGCGGCCTA TAGCACCGAG
TGGGGCAAGC GGATGACCGG CGAGGGTCCG GTGGCCGAGG TAATGAGCCA GCGGTTCCAC
CTGGCGCGGA CACGCTTCGG TTTGGACCGC AAGCTGCCGC CGTTGGATCT GAGCCAGTTC
GCCGTCCCCG CCAAGGCGGG CGATCAGTTG TCGCTGTTCT AG
 
Protein sequence
MFSAVVHDRI MVAPANLPGH PPSNVRGRGA KSNRTGRFES QVSETFDDGW GEEDEPAQIA 
TTLQPMKSRT IIARNDSPDV GFESSINPYR GCSHGCIYCY ARPAHAYLGH SPGLDFETRI
YFKPEAGKLL ERELSKKGYA PKVIHIGGDT DPYQPDERQL RVTRAVIETL ARFRHPFTII
TKSALITRDL DILGPMGQAG LARAAVSITS LDHRLSRSME PRAATPKRRL DAVRQLTAAG
VPTTVMFAPS IPSLNDHEME GVLEAAAAAG ATTAGYVAVR LPLEIKDLFE EWLAAEHPDR
AKRVMSLVRQ MRGGAAYSTE WGKRMTGEGP VAEVMSQRFH LARTRFGLDR KLPPLDLSQF
AVPAKAGDQL SLF