Gene Caul_3099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3099 
Symbol 
ID5900554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3362929 
End bp3364623 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content74% 
IMG OID641563602 
ProductDNA repair protein RecN 
Protein accessionYP_001684724 
Protein GI167647061 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.545519 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.794323 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATCG GCCTTTCCAT CCGCGACGTC GTGCTCATCG AAAGCCTGGA CCTGGCCATC 
GGTCCGGGGC TGACCGCCCT GACCGGCGAG ACGGGGGCGG GCAAGTCGAT CATCCTGGAC
GCCCTGGGCC TGGCCACCGG GGCTCGGGCT GACGCCGGGC TCGTTCGTCG GGGGGCTGCA
GGCCACGCCA GCGCCACCGC CATCTTCGCC CTGCCGGCCG ACCACGCCGC CTTCGCCTAT
CTCGACGACA AGGGCCTGGA CTACGCCCGC GACGAGGATC TGGTCCTGCG CCGCCAGCTG
TCGCCCGACG GCCGTAGCCG CGCCTTCGTC AACGATCAGG CCACCAGCAT CGGGGTGCTC
AAGGACCTGG GCGCGCTGCT GCTCGAGGTT CACGGTCAGC ACGAGACGGT CGGCCTGCTG
GACGCGCGTA CCCACCGTGC TCTGCTGGAC GCCTTCGGCC TGGTCAGCGT CGGGACCGTC
GGCTCGGCCT GGAGCGCCTG GCGGGCCGCC CGCGAGAAGG CCGCCGCCCT GCGCGACCTG
GCTGACCGCG CCGCCGTCGA GACCGAGGAA CTGACCCTTC GCCTGTCGGA GCTGGATCGC
CTGGATCCCC GTGAAGGCGA GGAGACCGCC CTGGCCGAGG AGCGCGCCCT GCTGGGCTCG
GCCGAGAAGG CCCTGGCCGA CATCGCCGCC GCCTCCGACG CCCTGGGCGG CGACGCGTTG
TCGGGCAAGC TGGCCAGCGC CTTCCGCGCC CTAGAACGGG CCCGCGACCG CGCCATCCAG
GCCGGCGCCC CCGCCGACGG CCCAGCCGTC ACAAAGCTGG CCGCCGCCTG CGAGGCGGTG
GACCGCGCCC TGGTCGAGGC CCAGGAGGCC GCCGCCGCCA TCGACGCCGC CGCCGAGGGT
TTCGAGTTCG AGCCCGACCG GCTGGACAAG GCCGAGGAAC GGCTGTTCGC CCTGCGCGCC
ATGGCTCGCA AGCTGAACAT CGCCACCGAG CTGCTGCCGG GCAAGCGCGC CGAGTTCGCC
GCCCAGCTGC AGGCCATCGA GACCTCCAGC GAGGCCCTGA AGGCCGCCGA ACTGGAGGCC
GCCGACGCCC GCGACGCCTA TCTGTTCGCC GCCCAGGCCC TGTCGGCCGA GCGTCGCGCG
GCCGGCGACC GCCTGGCCGC CGCCGTCGAG AGCGAGCTGA CGCCGCTGAA ACTGGAAAAG
GCCCGGTTCC GCGTCGCCAT TGAACCGCTC GGCGAGGACC GCGCCGGCCC GATGGGTGTC
GACCGCGTGG CCTTCGAGAT CTCGACCAAT CCCGGCGCGC CGTTCGGCCC GATGGAGGCC
ATCGCCTCTG GCGGCGAACT GGCCCGCTTC GCCCTGGCCC TGAAGGCCGC CCTGGCGGGT
CGCGAGGGAC CCCAGCCGCT GATGATCTTC GATGAAGTCG ACCAGGGGGT CGGCGGGGCC
GTGGCCGACG CCGTGGGCCT GCGGCTCAAG CGTCTGGCGG CGAACGCCCA GGTTCTGGTG
GTCACCCACT CGGCCCAGGT CGCCGCCCGG GCCGACGCCC ACTGGCGGAT CGCCAAGTCC
GGCGACGACA CCGCGATCCG CACCCGGGTC GAGCCCCTGT CGCCGGCCCA GCGCCAGGAA
GAGATCGCCC GCATGCTGTC GGGGGCCAAG GTCACCGAGG CGGCGCGAGC CGCGGCCCGG
GCGTTGATTG GGTAG
 
Protein sequence
MLIGLSIRDV VLIESLDLAI GPGLTALTGE TGAGKSIILD ALGLATGARA DAGLVRRGAA 
GHASATAIFA LPADHAAFAY LDDKGLDYAR DEDLVLRRQL SPDGRSRAFV NDQATSIGVL
KDLGALLLEV HGQHETVGLL DARTHRALLD AFGLVSVGTV GSAWSAWRAA REKAAALRDL
ADRAAVETEE LTLRLSELDR LDPREGEETA LAEERALLGS AEKALADIAA ASDALGGDAL
SGKLASAFRA LERARDRAIQ AGAPADGPAV TKLAAACEAV DRALVEAQEA AAAIDAAAEG
FEFEPDRLDK AEERLFALRA MARKLNIATE LLPGKRAEFA AQLQAIETSS EALKAAELEA
ADARDAYLFA AQALSAERRA AGDRLAAAVE SELTPLKLEK ARFRVAIEPL GEDRAGPMGV
DRVAFEISTN PGAPFGPMEA IASGGELARF ALALKAALAG REGPQPLMIF DEVDQGVGGA
VADAVGLRLK RLAANAQVLV VTHSAQVAAR ADAHWRIAKS GDDTAIRTRV EPLSPAQRQE
EIARMLSGAK VTEAARAAAR ALIG