Gene Caul_2701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2701 
Symbol 
ID5902555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2934574 
End bp2936199 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content66% 
IMG OID641563193 
Productzinc finger SWIM domain-containing protein 
Protein accessionYP_001684326 
Protein GI167646663 
COG category[S] Function unknown 
COG ID[COG4715] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.322732 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGTA TCCACGAAAC CCTGTCGGAC AGGCGACTGG AGCGGGCCGC CGGGTCCGCC 
GCTTTCGCGC GCGGCGCGGC CTATCACGCG CAAGGTCGCG TGGATCTATT GTCGCTGGGC
GACGATCAGG CCGTGGCTCG GGTGACGGGG TCGGAAATCT ATCGGGTGAG GCTACGTTGG
CGTGGAGGCG TCGCGGAGGG CGCTTGCGAT TGTCCCGCCT TCGATTCCAC TGATTTCTGC
AAGCATATGG TCGCCTTGGC GATCACAGCG CGAGAGCGGA CCGATGACGG GCCGGCGATC
GACCGGCGCG TGCGTCTCGT CGAGCATCTG CGCCGTCAGG GACTGGAAGC GGCGGTGGCT
CGCCTGGTGT CGCTGGCCGA GCAGCACCCG GAGGTCTGGT CCGAGATCGA GGCCGAAGCT
CAGGACGCCG TCGAAGACGA CCAGACGCTC GTGCGCCGAT ACAGGGCGGA GATCGAATCC
GCCTGCGATG TTCCAGGACC CATCGGCTAT TACAGTGTCG GGAGCTATGC CGAAGGCCTG
TTCGCCTTGC TCGATCGCCT GGAGCGCCTG AACGCCGGCG GACGAGCGAC CGCCGTGTCC
GCTCTGATGG TGCATTTCCT TGAAAACATG CAGGAGGTCT TCGAAGCCAT CGACGACTCC
GAAGGCGAGG TGACCTCCGC CGTCCAGCGG GCCGTCGAGA TTCACCTGGC TGCTTGCCGG
GAGACCAAGC CCGACCCGTT GGATCTGGCC GGATGGCTGT TCACTCAGGA GATGGACAGC
GAGTGGCCCG CCTTCGAGGA CCTTCGCATC GACTATGCCG AGGTGTTGGG CGAGGCCGGA
ATGGCTGAGT ATCGCCGACT GGCCGAGGCG GCCTGGGCGG CGGTCGCGAC CAAGGATCGA
GCCGCGCAAT ATACCTTGCG GGCGATCCTC GACCACTTCG CCTGCCAAGA CGGCGACCTC
GACGCGCGGA TCGCGCTGCG CGGCGCCGAC CTTTCCGGAC CTTACGCCTA TCTGGAGATC
ATCCAGATCT GCATGGAGGC CGAGCGGCTG GACCTAGCCC TGAAATGGGC GCGGGAGGCG
GTCTGGATTT TCGAGGACGC CCCCAATGCC CGGCTGGTGA GCCTGGCCGC CCAGCTCGAA
GAAAAGGCGG GGCGAAGCGA CGAGGCCGTG TCCATGCTCT GGCGAACCTT CGAACGGTCG
CCCGACCTCG CCCTGCTGGG CGACCTGAAA CGCCTTTCGC CGACAGATGT CATCGACAAG
GCCGCCGAGA TCCTGGAGGC CAAGGGATAC TCGGCGATGT TGTTCGAACT GCAACTGGCG
GAAGGCCGGC TGGACGCCGC CTGGAAGATC GCCGATGACC ACCCCATCGC CGACTGGCGC
CTGAAGGCGC TCGCCGACGC CAGCCACCAA ACCCATCGCC TGAAGGCCCA GGCCGCCTAT
GAGCGCCTGG CTGAGTCCAG CGTGCGTCTG GCCAATGTCG GCGCCTACGA TACGGCGATC
AAACTCATTC GCCTTCGGGG GCAGGTCTGT GATGATCCCG CTTCACAGGC GGCCTACATC
GCCGACCTCG CCACGCGTCA CAAGGCCAAG CGCACCTTCA TCCAGCGCCT GGAAGGTCTT
CGCTGA
 
Protein sequence
MSRIHETLSD RRLERAAGSA AFARGAAYHA QGRVDLLSLG DDQAVARVTG SEIYRVRLRW 
RGGVAEGACD CPAFDSTDFC KHMVALAITA RERTDDGPAI DRRVRLVEHL RRQGLEAAVA
RLVSLAEQHP EVWSEIEAEA QDAVEDDQTL VRRYRAEIES ACDVPGPIGY YSVGSYAEGL
FALLDRLERL NAGGRATAVS ALMVHFLENM QEVFEAIDDS EGEVTSAVQR AVEIHLAACR
ETKPDPLDLA GWLFTQEMDS EWPAFEDLRI DYAEVLGEAG MAEYRRLAEA AWAAVATKDR
AAQYTLRAIL DHFACQDGDL DARIALRGAD LSGPYAYLEI IQICMEAERL DLALKWAREA
VWIFEDAPNA RLVSLAAQLE EKAGRSDEAV SMLWRTFERS PDLALLGDLK RLSPTDVIDK
AAEILEAKGY SAMLFELQLA EGRLDAAWKI ADDHPIADWR LKALADASHQ THRLKAQAAY
ERLAESSVRL ANVGAYDTAI KLIRLRGQVC DDPASQAAYI ADLATRHKAK RTFIQRLEGL
R