Gene Caul_1212 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1212 
Symbol 
ID5898667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1274932 
End bp1276194 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content70% 
IMG OID641561697 
Productpentapeptide repeat-containing protein 
Protein accessionYP_001682840 
Protein GI167645177 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.643735 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGG CGCAGACCCT TGCTGGACGG CGAAGGCTCA GCCAGGCCGA ACTGGACATG 
ATCGTCGCCG CGCACGAGAA ATTCGTCACC GGCAAGCAGG GCGGCAAGCG GGCCTCGCTG
CGTTTCATGA ACCTGTCGGG CCTCGACCTG TCCTTCCGCA ATCTGGCCGA CGCCGATTTC
TCCGCCTCCA TCCTCGACGG CTGCCGCATG GTCCGCACGC GGCTGGAACG CGCCAACCTG
TTCGGCGCCG ACCTGCGCAA GGCCGACCTG CGCCAGGCGG TCCTGATCCG CGCCGACCTG
CGCGGCGCCT GCCTGCGCGG GGCCAACCTG TCCCAGGCCG ACCTGACCCA GGCCGATTTC
CGCGAAGGCC AGGTGGCCAT TCCGCATCCG CGCAAGGGCC TGGAGACCGT TCGCCACGAG
ACCCGCACCG GCGAGGTGGA CGAGGTCAAT TTCTCGGGCG CGACGCTGGA CGGCTCGCAG
TTCGCCGGCG TCTCGGCCTT CAAGGCCGAT TTCAGCGACT GCTCGTTGCG TGGAGCCAAG
CTTGCGGGCG CCAACCTCAA GGAGGCCAAC CTGACCGGCG CCATCCTGGA TGGGGCCGAT
GTCAAGGGCG CCAATCTGGA AGGCGCCAAC TTCACCGGCG CGGTGATGGC CGGCGTCGAC
ATCTCCACCG CCCGCACCCA GGGCGCGGCC ATGCAGGGCT GTCTGACGGA CTCGACCGAG
CGCGCGCTGT CGCGGGTCGA CGAGATCCTG GAGCGCTGCA TGGGCAACCA GGCCTGGTGC
AAGACCGGCG GCAAGGAAGG CGCGCCCGCT CGCCTCGACG GCGAGGACCT GCGTCCGCTC
GGCGACCGCC TCAAGGGGCT GCGCCTGACG GCGATGAGCG CCTCGGGGGC CTGCATGATC
GGCCTGGACC TGTCCGGCGC CCAACTGCAG GGCGCCAATC TGCAGAACGC CGACCTGCGC
TCGGCCAACC TGCGCGGCGC CGACCTGCGC GGGGCCAAGC TGTCGGGCGC GAACCTGACC
AAGGCCGACC TGCGACAGGC GTTCCTGTCG CCCTTGCCGC TGGGGCCGGA ACGCAAGACC
CTGGTCAATC TGAAGGCGGC GCGCCTGCGC TACGTCCAGT TCCAGGCGGC CGACCTCAGC
GAGGCGGTGC TGGACGGCGC CGACCTGCGC GGCGCGGACT TCACCGGCGC GCACCTGGGC
AAGGTGAGCC TGCGTGATTG CGACCTGACC CAGGTGCAGG GACTGGAGCT GGTTCCGGGC
TGA
 
Protein sequence
MTAAQTLAGR RRLSQAELDM IVAAHEKFVT GKQGGKRASL RFMNLSGLDL SFRNLADADF 
SASILDGCRM VRTRLERANL FGADLRKADL RQAVLIRADL RGACLRGANL SQADLTQADF
REGQVAIPHP RKGLETVRHE TRTGEVDEVN FSGATLDGSQ FAGVSAFKAD FSDCSLRGAK
LAGANLKEAN LTGAILDGAD VKGANLEGAN FTGAVMAGVD ISTARTQGAA MQGCLTDSTE
RALSRVDEIL ERCMGNQAWC KTGGKEGAPA RLDGEDLRPL GDRLKGLRLT AMSASGACMI
GLDLSGAQLQ GANLQNADLR SANLRGADLR GAKLSGANLT KADLRQAFLS PLPLGPERKT
LVNLKAARLR YVQFQAADLS EAVLDGADLR GADFTGAHLG KVSLRDCDLT QVQGLELVPG