Gene Caul_1063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1063 
Symbol 
ID5898518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1121695 
End bp1123461 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content64% 
IMG OID641561545 
Productmyosin-cross-reactive antigen 
Protein accessionYP_001682691 
Protein GI167645028 
COG category[S] Function unknown 
COG ID[COG4716] Myosin-crossreactive antigen 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.488825 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACTACA GCAGCGGAAA TTACGAGGCC TTCGCCAGGG CTCGCAAGCC GAAGGGCGTG 
GAGAACAAGA CGGCGTTCTT CGTCGGCTCG GGACTGGCCA GCCTGGCCGG CGCGGCCTTC
CTGATCCGCG ACGGGCACAT GTCCGGCGAC AAGATCACCA TCCTTGAACG GCTGGCGCTG
CCGGGCGGCG CGCTGGACGG GATCAAGGAG CCCGAGAAGG GTTTCGTGAT CCGCGGCGGT
CGCGAAATGG AAGAACATTT CGAGTGCCTC TGGGACCTCT ACCGCTCGAT CCCCTCGCTC
GAGGTCGAGG GGGCCAGCGT GCTCGACGAG TTCTACTGGC TCGACAAGGA CGACCCGAAC
TTTTCCTTGT CACGGGCGAC CCAGAACCGG GGACAGCCCA TCGAGGACAT CGATCGCCTC
ACCCTCAACG ACAAGGCCCA GAAGGACCTC ATCACGCTCT TCCTGGCGAC CCGCGAGGAG
ATGGAGAACA AGCGCATCAA CGAGGTGCTG GGCGAGGACT TCCTGAAGAG CAATTTCTGG
CTCTACTGGC GCACGATGTT CGCCTTCGAG GAATGGCATT CGGCGCTGGA GATGAAGCTC
TATGTCCACC GCTTCATCCA CCACATCGGC GGCCTGGCCG ATTTCAGCGC CCTGAAGTTC
ACGCGGCGCA ACCAGTACGA GTCGCTGGTG CTGCCGCTCT ACAAATGGCT GACCGATCAT
GGCGTGACGT TCCGGTACGG CGTGGAGGTC ACCGACGTTG ACTTCGACAT CACGCCCGAG
CGCAAGCAGG CCACCCGCAT TCACTGGCTG CGAGACGGCG CCGCCGGCGG CGTCGATCTC
GGCCCCGACG ACCTGGTCTT CATGACCATC GGCTCGCTGA CCGAGAACTC CGATAACGGC
GATCACCATA CGCCCGCGAC GCTGAACGAA GGCCCCGCCC CCGCCTGGGA CCTGTGGCGC
AACATCGCGG CCAAGGATCC CGCCTTCGGT CGGCCCGACG TCTTCGGCGC CCATATCCCG
CAGACCAAGT GGGAATCGGC GACAATCACG ACGCTGGACC CCCGCATCCC GTCCTACATC
CAGAAGATCG CCAAGCGCGA CCCGTTCAGC GGCACGGTCG TCACCGGCGG CATCGTCACC
GTCAAGGATT CCAGCTGGTT GATGAGCTGG ACCGTCAGCC GCCAGCCGCA CTTCAAGCAA
CAGCCGAAAG ACCAGGTGAT CGTCTGGGTC TATTCGCTGT TCGTCGACAC CCCCGGGGAC
TTCGTCAAGA AGCCGATGCA GGATTGCACG GGCGAGGAGA TCACCCAGGA ACTGCTCTAT
CACCTCGGCG TGCCGGTCGA GGACATCGCG GAACTGGCCG CGACGGGCGC CAAGGCCGTG
CCGGTGATGA TGCCCTACAT CACCGCTTTC TTCATGCCGC GGCAGGCGGG GGATCGTCCG
GACGTCGTGC CCCAGGGGGC GGTCAATTTC GCCTTCATCG GCCAGTTCGC CGAGTCGGCC
GAACGCGACT GCATCTTCAC CACCGAGTAT TCGGTGCGCA CGCCGATGGA GGCGGTCTAC
ACGCTGCTGA ACGTGGAGCG CGGCGTTCCC GAGGTCTTCA ACTCGACCTA CGACGTGCGC
AAGCTGCTGG CCGCCGTCGG GCACGTTCGT GACGGCAAGG CCCTCGATCT TCCAGGACCG
GCGTTCATCC GCGATCTGCT GCTGAAGAAG ATCGACGCCA CCGAGATCGG CGGCCTGCTC
CGAGACGCGC ATTTGATCTC GGAGTGA
 
Protein sequence
MYYSSGNYEA FARARKPKGV ENKTAFFVGS GLASLAGAAF LIRDGHMSGD KITILERLAL 
PGGALDGIKE PEKGFVIRGG REMEEHFECL WDLYRSIPSL EVEGASVLDE FYWLDKDDPN
FSLSRATQNR GQPIEDIDRL TLNDKAQKDL ITLFLATREE MENKRINEVL GEDFLKSNFW
LYWRTMFAFE EWHSALEMKL YVHRFIHHIG GLADFSALKF TRRNQYESLV LPLYKWLTDH
GVTFRYGVEV TDVDFDITPE RKQATRIHWL RDGAAGGVDL GPDDLVFMTI GSLTENSDNG
DHHTPATLNE GPAPAWDLWR NIAAKDPAFG RPDVFGAHIP QTKWESATIT TLDPRIPSYI
QKIAKRDPFS GTVVTGGIVT VKDSSWLMSW TVSRQPHFKQ QPKDQVIVWV YSLFVDTPGD
FVKKPMQDCT GEEITQELLY HLGVPVEDIA ELAATGAKAV PVMMPYITAF FMPRQAGDRP
DVVPQGAVNF AFIGQFAESA ERDCIFTTEY SVRTPMEAVY TLLNVERGVP EVFNSTYDVR
KLLAAVGHVR DGKALDLPGP AFIRDLLLKK IDATEIGGLL RDAHLISE