Gene Caul_2217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2217 
Symbol 
ID5899672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2414560 
End bp2415816 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content64% 
IMG OID641562709 
Productintegrase family protein 
Protein accessionYP_001683843 
Protein GI167646180 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.664863 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.427624 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCACCG ATGCCGCACT CAAGTTTTTG AAACCAAAGA AGAAACCGTA CAAGGTCGCG 
GACCGTGACG GTATTTACGT CGTGGTCTCT CCCGCCGGGT CCATCACCTT CCGGCTCGAC
TACCGGATCA ACAACCGCCG GGAGACCCTG ACGCTAGGTC GTTACGGGCG CGATGGCGTC
AGTCTGCTCA AGGCGCGCGA GCTTGGGATG GAAGCTCGGC GCAGGGTGAG GGAAGGGGTC
TCGCCGGCGA TCGAAAAGCA GCGTGACAAG GCGCGCATCA AGGCGGCCAA GACCTTCGCC
GATTTCGGGC GCAAGTGGAT CGAGGAAGGC CGCATGGCCG ATAGCACTCG CGCCATGCGC
CGCGCGATCT ACGAGCGCGA CGTCGAGCCG GCCTTCAAGA ACCGGATGCT GACCGAGATC
GAACCGGGCG ATGTTCGCGC GCTCTGCCAA CTGGTGAAGG ATCGCGGGGC GCCCGCGACC
GCCATCCACA TCCGCGATCA GATCAACCTG ATCTTCGCCT TCGCGCGCCT GCACGGCGAG
AAGGTGGAAA ACCCCGCCAA GGATGTCAGC CCGTCGTCGA TCTGGTCTTT CACGCCGCGC
GAGCGGGCGT TGTCGCCCAA GGAGATCCGG CTGCTCTATC CGCTCCTTGA GCAGGTCCCG
ACCTTGCCGA CGATCCGGCT TGGCATGAAG CTCATCCTGC TGACCCTGGT GCGCAAGAGC
GAGCTGCAGG ACGCCACCTG GGACGAGGTC GACTTCGTCA ACGCGATCTG GAGCATTCCC
GCCGCCCGGA TGAAGGCCAG CCGGGCCCAC AACATCTACC TGTCGACCCA GGCCCTCGAC
ATCATGATCG CGCTGAAGAC CTGCGCGGGA AATTCACCGT ACCTCCTGCC TTCGCGCTAT
GAGGCCGACC AGCCCATGTC GCGGGCTACG TTCAACCGCG TAACCATGAG CATTGCCGAG
CGGGCCAAGG CGCAGGGGCT CCCGCTCGCG CCGTTCACGG TTCACGACCT GCGCCGGACC
GGCTCGACCC TGCTCAACGA GATCGGATTC GAAAGCGACT GGATCGAGAA GTGCCTGGCC
CACGTCGATC GGCGAACCTC CCGTCGCGTC TACAATGTCG CCGAGTACGC CCAGCAGCGC
CGGCACATGC TTCAGGAGTG GGCGGACATG ATCGACGCCT GGGTGCGGGG CGAGCGGCAT
GTCCCGACCT TAAAACCGGC TGACATTCAC GGGGTTACGC TGGATCCACG GGCCTGA
 
Protein sequence
MLTDAALKFL KPKKKPYKVA DRDGIYVVVS PAGSITFRLD YRINNRRETL TLGRYGRDGV 
SLLKARELGM EARRRVREGV SPAIEKQRDK ARIKAAKTFA DFGRKWIEEG RMADSTRAMR
RAIYERDVEP AFKNRMLTEI EPGDVRALCQ LVKDRGAPAT AIHIRDQINL IFAFARLHGE
KVENPAKDVS PSSIWSFTPR ERALSPKEIR LLYPLLEQVP TLPTIRLGMK LILLTLVRKS
ELQDATWDEV DFVNAIWSIP AARMKASRAH NIYLSTQALD IMIALKTCAG NSPYLLPSRY
EADQPMSRAT FNRVTMSIAE RAKAQGLPLA PFTVHDLRRT GSTLLNEIGF ESDWIEKCLA
HVDRRTSRRV YNVAEYAQQR RHMLQEWADM IDAWVRGERH VPTLKPADIH GVTLDPRA